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Micromonospora echinospora genes encoding for biosynthesis of 
calicheamicin and self-resistance thereto 

This application is a continuation-in-part of the non-provisional application 
09/457045, filed December 7, 1999 and claims benefit thereof, which application is 
incorporated herein by reference in its entirety. This application also claims benefit from 
provisional application 60/1 1 1,325 filed on December 7, 1998, which application is 
incorporated herein by reference in its entirety. 

Field of the Invention 

The present invention relates to a biosynthetic gene cluster of Micromonospora 
echinospora spp. calichensis. In particular, the calicheamicin biosynthetic gene cluster 
contains genes encoding for proteins and enzymes used in the biosynthetic pathway and 
construction of calicheamicin' s aryltetrasaccharide and aglycone, and the gene conferring 
calicheamicin resistance. The present invention also relates to isolated genes of the 
biosynthetic cluster and their corresponding proteins. In addition, the invention relates to 
DNA hybridizing with the calicheamicin gene cluster and the isolated genes of that cluster. 
The invention also relates to expression vectors containing the biosynthetic gene cluster, 
the individual genes, or functional variants thereof. 

Background of the Invention 

The enediyne antibiotics, which were discovered in the 1980's, have long been 
appreciated for their novel molecular architecture, their remarkable biological activity, and 
their fascinating mode of action. Enediyne antibiotics were originally derived by 



1 



fermentation of microorganisms, including Micromonospora, Actinomadura, and 
Streptomyces. Rothstein, D. M., Enediyne Antibiotics as Antitumor Agents, p. 2 (1995). 
As a class, the enediyne antibiotics have been referred to as the most potent and highly 
active antitumor reagents yet discovered. Rothstein, D. M., Enediyne Antibiotics as 
Antitumor Agents, preface (1995). 

To date, at least twelve members of this family of antibiotics have been discovered, 
all of which fall roughly into two categories. The members of the first category of 
enediynes are classified as chromoprotein enediynes because they possess a novel 9- 
membered ring chromophore core structure, which also requires a specific associated 
protein for chromophore stabilization. The members of the second category of enediyne 
are classified as non-chromoprotein enediynes. These enediynes contain a 1 0-membered 
ring, which requires no additional stabilization factors. This enediyne ring structure is 
often referred to as the "warhead." The warhead induces DNA damage, which is frequently 
a double-stranded cleavage and appears to be irreparable. This type of DNA damage is 
usually nonrepairable for the cell and is most often lethal. Because of these remarkable 
chemical and biological properties, there has been an intense effort by both the 
pharmaceutical industry and academia to study these substances with the goal of 
developing new and clinically useful therapeutic anti-tumor agents. 

The 9-membered ring chromoprotein enediyne subfamily is comprised of: 
neocarzinostatin from Streptomyces carzinostaticus,(Myers, A.G., et al., J. Am. Chem. 
Soc, 110, 7212-7214 (1988)); kedarcidin from Actinomycete L585-6, (Leet, J.E., et al., J. 
Am. Chem. Soc, 114, 7946-7948 (1992)), N 1999 A2 from Streptomyces globisporus, 
(Yoshida, K., et al. Tetrahedron Lett., 34, 2637-2640 (1993)), maduropeptin from 



2 



Actinomadura madurea, (Schroeder, D.R., et al., J. Am. Chem. Soc, 116, 9351-9352 
(1994)); N1999A2 from Streptomyces sp. AJ9493, (Schroeder, D.R., et al., J. Am. Chem. 
Soc, 116, 9351-9352 (1994)); actinoxanthin from Actinomyces globisporus, (Khokhlov, 
A.S., et al., J. Antibiot, XXII, 541-544 (1969)); largomycin from Streptomyces 
pluricolorescens, (Yamaguchi, T., et al., J. Antibiot., XXIII 369-372 (1970)); 
auromomycin from Streptomyces macromomyceticus, (Yamashita, T. ? et al., J. Antibiot., 
XXXII, 330-339 (1979)), and sporamycin from Streptosporangium pseudovulgare, 
(Komiyama, K, et al., J. Antibiot, XXX, 202-208 (1977)), all of which are believed to 
possess a novel bicylo[7.3.0.]dodecadiynene chromophore core structure essential for 
biological activity. In addition, with the exception of N1999A2, a required apoprotein acts 
as a stabilizer and specific carrier for the unstable chromophore, and for its transport and 
interaction with target DNA. 

The non-chromophore enediyne subfamily is comprised of calicheamicin from 
Micromonospora echinospora spp. calichensis; namenamicin from Polysyncraton 
lithostrotum; esperamicin from Actinomadura verrucosospora; and dynemicin from 
Micromonospora chersina. 

Enediyne antibiotics have potential as anticancer agents because of their ability to 
cleave DNA; however, many of these compounds are too toxic to be used currently in 
clinical studies. Today, only calicheamicin is known to be currently used in clinical trials; 
and it has provided promising results as an anticancer agent. For example, MyloTarg™, a 
calicheamicin-antibody conjugate also known as CMA-676 was approved by the FDA in 
January of 2000 to treat acute myelogenous leukemia. The enediynes also potentially have 
utility as anti-infective agents, provided that toxicity can be managed. 
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Calicheamicin has two distinct structural regions: the aryltetrasaccharide and the 
aglycone (also known as the warhead). The aryltetrasaccharide displays a highly unusual 
series of glycosidic, thioester, and hydroxylamine linkages and serves to deliver the drug 
primarily to specific tracts (5'-TCCT-3' and 5'-TTTT-3 r ) within the minor groove of DNA 
when those sequences are available. However, specificity is also context-dependent. The 
aglycone of calicheamicin consists of a highly functionalized bicyclo[7.3.1]tridecadiynene 
core structure with an allylic trisulfide serving as the triggering mechanism. McGahren, 
W.J.,et aL, Enediyne Antibiotics as Antitumor Agents, pp. 75-86 (1995). Once the 
aryltetrasaccharide is firmly docked, aromatization of the bicyclo[7.3.1]tridecadiynene 
core structure, via a 1,4-dehydrobenzene-diradical, results in the site specific oxidative 
double strand scission of the targeted DNA. Zein, N., et al., Science, 240, 1 198-1201 
(1988). The aglycone undergoes a reaction that yields carbon-centered diradicals, which 
are responsible for DNA cleavage. 

This activity of calicheamicin has sparked considerable interest in the 
pharmaceutical industry culminating in the recent FDA approval of the calicheamicin- 
antibody conjugate MyloTarg™ (CMA-676) to treat acute myelogenous leukemia (AML). 
Additionally, similar strategies have been used in phase I trials to treat breast cancer. A 
massive program to examine calicheamicin conjugated to alternative delivery systems has 
also recently been undertaken. Hamann, P.R., et al., 87th Annual Meeting of the American 
Association of Cancer Research, Washington, D.C., pp. 471 (1996); Hinman, L.M., et al., 
Cancer Res., 53, 3336 (1 993); Hinman, L. M., et al., Enediyne Antibiotics as Antitumor 
Agents, pp. 87- 105 (1995); Sievers, E.L., et al, Blood, 93, 3678-3684 (1999); Siegel, 
M.M., et al., Anal Chem., 69, 2716-2726 (1997); Ellestad, G. personal communication. 
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The biological activity and molecular architecture of calicheamicin has also 
prompted a search for potentially useful analogs. Of the numerous laboratories producing 
synthetic analogs, one group has produced a novel calicheamicin y\ shown to effectively 
suppress growth and dissemination of liver metastases in a syngeneic model of murine 
neuroblastoma. Lode, H. N., et al., Cancer Res., 58, 2925-2928 (1998); Wrasidlo, W., et 
al., Acta Oncologica, 34, 157-164(1995). In addition to synthesizing calicheamicin 
analogs, random mutagenesis of M. echinospora and screening for mutant strains with 
improved biosynthetic potential has also been pursued. Rothstein, D. M., Enediyne 
Antibiotics as Antitumor Agents, pp. 107-126 (1995). 

The first total synthesis of calicheamicin was reported by Nicolaou and coworkers 
in 1992. Synthesizing this complex antibiotic, though, presents many disadvantages. For 
example, Nacelle's procedure only provides approximately a 0.007% yield and requires 47 
steps. Halcomb, R.L., Enediyne Antibiotics as Antitumor Agents, pp. 383-439 (1995). 
Thus, the total synthesis of calicheamicin remains secondary to the isolation of 
calicheamicin from large fermentations of M echinospora. Therefore, methods to produce 
mass amounts of calicheamicin and potentially useful variants are still needed. Fantini, A., 
et al., Enediyne Antibiotics as Antitumor Agents, pp. 29-48 (1995). Transforming 
calicheamicin DNA into producing strains of bacteria, such as Streptomyces, 
Micromonospora, other actinomyces species, or E. coli, as non-limiting examples, would 
address this need. However, prior to the discoveries of the present inventors, no cloned M 
echinospora genes were available, and only a set of limited studies upon putative M. 
echinospora promoters were available. Lin, L.S., et al., J. Gen. Microbiol, 755,7557-1885 
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(1992); Lin, L.S., et al., J. BacterioL, 174, 31 1 1-31 17 (1992); Baum, E.Z., et al., I 
BacterioL, 171, 6503-6510 (1989); Baum. E.Z., et al., J. BacterioL, 170, 71-77 (1988). 

Calicheamicin' s molecular architecture in conjunction with its useful biological 
activity and potential therapeutic value brand calicheamicin an target for the study of 
natural product biosynthesis. While the radical-based mechanism of oxidative DNA 
cleavage by calicheamicin (i.e. aromatization of the bicyclo[7.3.1]tridecadiynene core 
structure, via a 1 ,4-dehydrobenzene-diradical, resulting in the site specific oxidative 
double strand DNA cleavage) is well understood, it was unknown, prior to this invention, 
how Micromonospora constructs calicheamicin. As a result, before the present invention, 
there was a need to discover and understand calicheamicin biosynthesis. Prior to this 
discovery of the present inventors, knowledge of genes encoding for nonchromoprotein 
enediyne biosynthesis was completely lacking. 

The toxicity of the enediyne compounds, including calicheamicin, centers on the 
problem of directing the compound to the cleave only the DNA of interest, such as tumor 
cell DNA, and not the DNA of the host. Due to calicheamicin's powerful ability to cleave 
DNA, scientists have investigated the mechanism by which calicheamicin-producing 
organism protects itself against the DNA-cleaving activity of the molecule. Rothstein, D. 
M., Enediyne Antibiotics as Antitumor Agents, p. 77 (1995). Prior to this invention, 
knowledge of genes encoding for non-chromoprotein enediyne self resistance was 
completely lacking. 

Summary of the Invention 

The present invention relates to the first identification, isolation, and cloning of a 
nonchromoprotein enediyne biosynthetic gene cluster and mapping and nucleotide 
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sequence analysis of the genes within the cluster. The invention provides the entire 
calicheamicin-biosynthetic cluster and biochemical studies of aryltetrasaccharide 
biosynthesis. Furthermore, the calicheamicin self-resistance gene and protein have been 
isolated, as have the genes and resulting enzymes for steps within the calicheamicin 
cascade. The invention also provides for construction of enediyne overproducing strains, 
for rational biosynthetic modification of bioactive secondary metabolites, for new drug 
leads, and for an enediyne combinatorial biosynthesis program. 

The present invention provides an isolated nucleic acid molecule from a 
nonchromoprotein enediyne biosynthetic gene cluster from Micromonospora echinospora 
comprising said nucleic acid molecule, a portion or portions of said nucleic acid molecule 
wherein said portion or portions encode a protein, a portion or portions of said nucleic 
acid molecule wherein said portion or portions encode a biologically active fragment of a 
protein. The isolated nucleic acid molecule may be single- or double-stranded. As used 
herein, a nucleic acid molecule, polypeptide, or protein described as being "from" e.g., an 
organism or gene cluster, may have been isolated from such organism or gene cluster; 
alternatively, it may be a molecule which has been produced using synthetic, chemical, 
recombinant, or other such methods and comprise an amino acid or nucleotide sequence 
which may be isolated from such organism or gene cluster. 

The present invention provides forty-eight genes, twenty-seven of which encode 
structural genes with the remainder encoding a variety of functions. The present invention 
is drawn to the following genes or nucleic acids: calC (SEQ ID No. 1), calH (SEQ ID No. 
3), calG (SEQ ID No. 5), calA (SEQ ID No. 7), calB (SEQ ID No. 9), calD (SEQ ID No. 
1 1 , calF (SEQ ID No. 1 3), call (SEQ ID No. 1 5), calJ (SEQ ID No. 1 7), calK (SEQ ID 
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No. 19), call (SEQ ID No. 21), calM (SEQ ID No. 23), calN (SEQ ID No. 25), ca/O (SEQ 
ID No. 27), ca/P (SEQ ID No. 29), (SEQ ID No. 31), calR (SEQ ID No. 33), calS 
(SEQ ID No. 35), calT (SEQ ID No. 37), calU (SEQ ID No. 39), ca/F(SEQ ID No. 41), 
ca/^(SEQ ID No. 43), calX (SEQ ID No. 45), 6MSAS (SEQ ID No. 47), ActI (SEQ ID 
No. 49), Actll (SEQ ID No. 51), ActHI (SEQ ID No. 53), orfl (SEQ ID No. 55), or/2 (SEQ 
ID No. 57), orfl (SEQ ID No. 59), orf4 (SEQ ID No. 61), orfl (SEQ ID No. 63), orf6 
(SEQ ID No. 65), or/7 (SEQ ID No. 67), or/8 (SEQ ID No. 69), orfl (SEQ ID No. 71), 
orfll (SEQ ID No. 73), orflll (SEQ ID No. 75), orflV(SEQ ID No. 77), orfV (SEQ ID No. 
79):, orfVI (SEQ ID No. 81), orJVII (SEQ ID No. 83), orJVIII (SEQ ID No. 85), 
(SEQ ID No. 87), 0/yX(SEQ ID No. 89), orfXI (SEQ ID No. 91), IS-element (DNA) (SEQ 
ID No. 93), calE (SEQ ID No. 94). The invention is also drawn to the following proteins 
or putative proteins: CalC (SEQ ID No. 2), CalH (SEQ ID No. 4), CalG (SEQ ID No. 6), 
CalA (SEQ ID No. 8), CalB (SEQ ID No. 10), CalD (SEQ ID No. 12), CalF (SEQ ID No. 
14), Call (SEQ ID No. 16), CalJ (SEQ ID No. 18), CalK (SEQ ID No. 20), CalL (SEQ ID 
No. 22), CalM (SEQ ID No. 24), CalN (SEQ ID No. 26), CalO (SEQ ID No. 28), CalP 
(SEQ ID No. 30), CalQ (SEQ ID No. 32), CalR (SEQ ID No. 34), CalS (SEQ ID No. 36), 
CalT (SEQ ID No. 38), CalU (SEQ ID No. 40), CalV (SEQ ID No. 42), CalW (SEQ ID 
No. 44), CalX (SEQ ID No. 46), 6MSAS (SEQ ID No. 48), ActI (SEQ ID No. 50), Actll 
(SEQ ID No. 52), Actlll (SEQ ID No. 54), Orfl (SEQ ID No. 56), Orf2 (SEQ ID No. 58), 
Orf3 (SEQ ID No. 60):, Orf4 SEQ ID No. 62), Orf5 (SEQ ID No. 64), Orf6 (SEQ ID No. 
66), Orf7 (SEQ ID No. 68), Orf8 (SEQ ID No. 70), Orfl (SEQ ID No. 72), Orfll (SEQ ID 
No. 74), Orflll (SEQ ID No. 76), OrflV (SEQ ID No. 78), OrfV (SEQ ID No. 80), OrfVI 
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(SEQ ID No. 82), OrfVII (SEQ ID No. 84), OrfVIII (SEQ ID No. 86), OrfIX (SEQ ID No. 
88), OrfX (SEQ ID No. 90), OrfXI (SEQ ID No. 92), CalE (SEQ ID No, 95). 

In one aspect, the present invention is directed to an isolated nucleotide molecule, 
wherein the nucleotide molecule hybridizes with at least one of SEQ ID NOS: 1, 3, 5, 7, 9, 
11, 13, 15, 17, 19,21,23, 25, 27, 29,31,33,35, 37, 39,41,43,45, 47, 49,51,53,55, 57, 
59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93 or 94, or a functional 
derivative of the isolated nucleotide molecule which hybridizes with at least one of SEQ 
ID NOS: 1,3, 5, 7, 9, 11, 13, 15, 17, 19,21,23,25, 27,29,31,33, 35, 37, 39,41,43,45, 
47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93 
or 94. In one embodiment of the invention, the isolated nucleotide molecule has the 
nucleotide sequence of at least one of SEQ ID NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 
25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 
73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93 or 94, i.e., 100% complementarity (sequence 
identity) with at least one of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 
29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 
77, 79, 81, 83, 85, 87, 89, 91, 93 or 94. In another embodiment of the invention, the 
isolated nucleotide molecule has at least 90% complementarity (sequence identity) with at 
least one of SEQ ID NOS: 1,3, 5, 7, 9, 11, 13, 15, 17, 19,21,23,25,27,29,31,33,35, 
37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 
85, 87, 89, 91, 93 or 94. In yet another embodiment of the invention, the isolated 
nucleotide molecule has at least 80% complementarity (sequence identity) with at least one 
ofSEQIDNOS: 1,3, 5, 7,9, 11, 13, 15, 17, 19,21,23,25,27,29,31,33,35,37, 39,41, 
43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 
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91, 93 or 94. In yet another embodiment of the invention, the isolated nucleotide molecule 
has at least 70% complementarity (sequence identity) with at least one of SEQ ID NOS: 1, 
3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 
53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93 or 94. In yet 
another embodiment of the invention, the isolated nucleotide molecule has at least 60% 
complementarity (sequence identity) with at least one of SEQ ID NOS: 1, 3, 5, 7, 9, 1 1, 13, 
15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 
63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93 or 94. In still yet another 
embodiment of the invention, the isolated nucleotide molecule is substantially 
complementary to at least one of SEQ ID NOS: 1,3,5, 7, 9, 11, 13, 15, 17, 19,21,23,25, 
27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 
75, 77, 79, 81, 83, 85, 87, 89, 91, 93 or 94. 

In another embodiment of the invention, there is provided an isolated 
protein encoded by a DNA molecule as described herein above, or a functional derivative 
thereof. A preferred protein has the amino acid sequence of at least one of SEQ ID NOS: 
2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 
52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, or 95 or a 
functional variant or derivative of one or more of those polypeptides. 

In another embodiment, the present invention provides an isolated nucleic acid 
molecule from Micromonospora echinospora comprising a nonchromoprotein enediyne 
biosynthetic gene cluster, a portion or portions of said gene cluster wherein said portion or 
portions encode a protein, a portion or portions of said gene cluster wherein said portion 
or portions encode a biologically active fragment of a protein, a single-stranded nucleic 



10 



acid molecule derived from said gene cluster, or a single-stranded nucleic acid molecule 
derived from a portion or portions of said gene cluster. 

In particular, the present invention provides an isolated nucleic acid molecule from 
Micromonospora echinospora spp. calichensis that is involved in the biosynthesis of 
calicheamicin. In another embodiment, the present invention also relates to nucleic acids 
capable of hybridizing with one or more isolated nucleic acids from a nonchromoprotein 
enediyne biosynthetic gene cluster from Micromonospora echinospora spp. calichensis. 
In a further embodiment, the invention provides an expression vector comprising an 
isolated nucleic acid molecule from a nonchromoprotein enediyne biosynthetic gene 
cluster from Micromonospora echinospora. In yet a further embodiment the invention 
provides a cosmid comprising an isolated nucleic acid molecule from a nonchromoprotein 
enediyne biosynthetic gene cluster from Micromonospora echinospora. 

In preferred embodiments, the invention provides the isolated nucleic acid 
molecules of SEQ IDNos. 1,3,5, 7, 9, 11, 13, 15, 17, 19,21,23,25, 27, 29,31,33,35, 
37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 
85,87, 89,91,93 and 94. 

In an additional embodiment, the present invention provides a host cell transformed 
with an isolated nucleic acid molecule from a nonchromoprotein enediyne biosynthetic 
gene cluster from Micromonospora echinospora. Host cells can optionally be of bacterial, 
yeast, fungal, insect, plant or mammalian origin and can be transformed according to 
standard methods. In a preferred embodiment, the host cell is the bacterium E. colU 
Streptomyces spp. , or Micromonospora spp. In a more preferred embodiment, the host cell 
is the bacterium from the genus Streptomyces or from the genus Micromonospora. 
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In a further embodiment, the invention is directed to a host cell transformed with an 
expression vector comprising at least one of the nucleotide sequences of SEQ ID Nos. 1, 3, 
5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 
55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93 , or 94 or a 
portion of portions thereof or an allele or alleles thereof. In preferred embodiments, the 
host cells produce a biologically functional protein or portion of a protein, which protein or 
portion thereof is encoded by the expression vector. 

In a specific embodiment, the invention is directed to a host cell transformed with 
an expression vector comprising ca/C, or a portion(s) or allele(s) thereof, operably linked 
to regulatory sequences that enable expression of CalC. In another specific embodiment, 
the invention provides a host cell transformed with an expression vector comprising calR, 
or a portion(s) or allele(s) thereof, operably linked to regulatory sequences that enable 
expression of CalH. In a yet further specific embodiment, the invention provides a host 
cell transformed with an expression vector comprising ca/Q, or a portion(s) or allele(s) 
thereof, operably linked to regulatory sequences that enable expression of CalQ. Likewise, 
the invention provides a host cell transformed with an expression vector comprising calG, 
or a portion(s) or allele(s) thereof, operably linked to regulatory sequences that enable 
expression of CalG. 

In a yet further embodiment, the invention is directed to a host cell transformed 
with an expression vector encoding at least one polypeptide comprising the amino acid 
sequence of SEQ ID Nos. 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 
38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 
86, 88, 90, 92, or 95 or a functional variant of one or more of those polypeptides. In 
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preferred embodiments, the host cells produce a biologically functional protein or portion 
of a protein, which protein or portion thereof is encoded by the expression vector. 

In a specific embodiment, the invention is directed to a host cell transformed with 
an expression vector encoding CalC, or a functional derivative thereof, operably linked to 
regulatory sequences that enable expression the encoded polypeptide. In another specific 
embodiment, the invention provides a host cell transformed with an expression vector 
encoding CalH, or a functional derivative thereof, operably linked to regulatory sequences 
that enable expression of the encoded polypeptide. In a yet another specific embodiment, 
the invention provides a host cell transformed with an expression vector encoding CalQ, or 
a functional derivative thereof, operably linked to regulatory sequences that enable 
expression of the encoded polypeptide. Likewise, the invention provides a host cell 
transformed with an expression vector encoding the CalG, or a functional derivative 
thereof, operably linked to regulatory sequences that enable expression of the encoded 
polypeptide. 

The invention further provides a method of expressing a protein by culturing a host 
cell transformed with an expression vector of the present invention, and incubating the 
host cell for a time and under conditions allowing for protein expression. 

In yet another embodiment the invention provides a method of purifying 
calicheamicin using affinity chromatography. A sample containing calicheamicin is 
contacted with an affinity matrix having the protein CalC bound thereto, for a time and 
under conditions allowing calicheamicin to bind to the matrix, eluting calicheamicin from 
the matrix, and recovering calicheamicin. 
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In a further embodiment the present invention provides polypeptides comprising 
the amino acid sequences of SEQ ID Nos. 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 
78, 80, 82, 84, 86, 88, 90, 92 and 95. 

In yet a further embodiment the invention provides the production of the following 
two new macrolides: 



The invention further provides a method of conferring calicheamicin resistance to a 
subject comprising obtaining cells from the subject, transforming the cells with the 
calicheamicin self-resistance gene, and returning the cells to the subject. Alternatively, the 
calicheamicin self-resistance gene can be targeted and delivered to the desired host cells 
through known gene therapy delivery systems. 

The invention further provides a method of producing calicheamicin analogs by 
altering calicheamicin or its bioactive metabolites through the modulation of the 
expression of calD, E, F 9 G, H, J, K f N, O, P, Q, S, T, U, V, W, X, 6MSAS, actl-III, orfl, 
orfffl, orjV, and orjVIL Such modulation can be achieved through selective "knock out", 
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as well as heterologous expression of these genes and their products. Various 
combinations of these either mutated or wild type gene products may be used in either in 
vitro or in vivo calicheamicin analog production. 

The invention further provides a method for increasing the production of 
calicheamicin through the introduction of multiple copies of positive regulators and 
transporters and or by eliminating or reducing the expression of negative regulators (e.g., 
CalA, B, I, L, Orf8). Additionally, upregulation of calicheamicin resistance genes calC, 
calN and orfXI can be used to decrease the toxicity of calicheamicin to healthy tissues and 
cells during therapy. 

In a yet further embodiment, the invention provides for a method of transposon 
mediated mutagenesis or moving chromosomal DNA fragments in vivo through 
expression of the orf3 integrase and the IS insertional element. 

The advantages of the present invention are numerous. Isolation of and the ability 
to clone calicheamicin DNA opens the door for genetic analysis of calicheamicin 
biosynthesis, as such analysis requires the ability to obtain large quantities of DNA which 
codes for calicheamicin biosynthesis. Using the teachings of the present invention, one can 
study calicheamicin biosynthesis via mutagenesis of M. echinospora. For example, one 
can isolate and characterize mutants blocked in calicheamicin biosynthesis and then 
analyze their defective or partial calicheamicin products. Additionally, particular a 
enzyme or enzymes can be overexpressed or underexpressed after subcloning its gene into 
a host such as E. coli, and the results of such overexpression or underexpression can be 
studied to reveal the enzyme's function. Furthermore, the cloning of biosynthetic genes 
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can ultimately result in increased yields of the gene product by cloning and expressing the 
biosynthetic gene encoding the rate-limiting enzyme back into the producing organism. 

Further, it may also be possible to generate novel products by cloning biosynthetic 
genes into strains that make related compounds. Such genes could endow the host 
organism with the ability to carry out new reactions on the enediyne nucleus, and thus 
produce novel drugs. The present invention thus also provides means for biosynthetic 
modification of bioactive secondary metabolites through enediyne combinatorial 
biosynthesis. As most pharmaceutical drug leads are inspired by naturally occurring 
compounds, and given the challenge posed in synthesizing these metabolites, genetic 
manipulation of the sugar appendage on the metabolites offers avenues for creating 
potential new drugs. Thus the emerging field of combinatorial biosynthesis has become a 
rich new source for modified non-natural sugar scaffolds. Marsden, A., et al., Science 
1998, 279, 199-201. Problems inherent with the genetic manipulation of the sugar 
appendage relate to the fact that naturally occurring bioactive secondary metabolites 
possess unusual carbohydrate ligands, which serve as molecular recognition elements 
critical for biological activity. Macrolide Antibiotics, Chemistry, Biology and Practice, 
1984. Without these essential sugar attachments, the biological activities of most 
clinically important secondary metabolites are either completely abolished or dramatically 
decreased. Currently, techniques for the genetic manipulation of the sugar appendage for a 
given metabolite rely mainly on the alteration and/or deletion of a small subset of genes 
required to construct and attach each desired sugar moiety. Thus there is a need to develop 
alternate strategies to construct and attach non-naturally occurring sugars. The present 
invention addresses this need. The present invention utilizes the fact that 
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glycosyltransferases, which are responsible for the final glycosylation of certain secondary 
metabolites, show a high degree of promiscuity toward the nucleotide sugar donor. Zhao, 
L., et al., J. Am, Chem. Soc. 1988, 120, 12159-12160. This unselectivity of the 
glycosyltransferases has the potential for allowing modification of the crucial 
glycosylation pattern of natural, or non-natural, secondary metabolite scaffolds in a 
combinatorial fashion. The present invention discloses a method using the recruitment and 
collaborative action of sugar genes from a variety of biosynthetic pathways to construct 
composite gene clusters, which make and attach non-natural sugars. 

Insight into how Micromonospora self resistance gene and gene products act to 
control the toxic effects of calicheamicin offers new avenues of clinical research. For 
example, knowledge of the mechanisms underlying calicheamicin resistance, as provided 
by the present disclosure, can provide the means necessary to use higher doses of 
calicheamicin while simultaneously inhibiting the toxic effects of the drug on non-cancer 
cells. Additionally, understanding the mechanism behind calicheamicin's self-resistance 
may aid in the understanding of self-resistance in other enediyne antibiotics, thereby 
potentially making useful those enediynes once thought to be too toxic to be viably used as 
therapeutic agents. The calicheamicin self-resistance mechanisms elucidated utilizing the 
present invention provide gene therapy approaches, for example, via introduction of 
enediynes resistance genes into bone marrow cells, thereby increasing resistance and 
allowing tolerance to chemotherapeutic doses of calicheamicin. Banerjee, D., et al., Stem 
Cells, 12, 378-385 (1994). Thus, understanding calicheamicin self-resistance will 
significantly aid continuing clinical studies involving calicheamicin and the enediynes. 
The present invention addresses this need as it provides for the isolation and 
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characterization of a resistance gene and its associated protein for any nonchromoprotein 
enediynes. 

Brief Description of the Figures 

Figure 1 depicts the summary of the cosmid clones isolated from M echinospora 
genomic library. This figure illustrates the results of the screening of the genomic library 
for clones carrying the calicheamicin biosynthetic cluster. 

Figure 2 shows a restriction map of a portion of cosmid clones 4b ? 13a, and 56 and 
the corresponding location of cat genes from M echinospora. 

Figure 3 is a table of the open reading frames ("orfs") in the calicheamicin 
biosynthetic cluster. This table lists the polypeptides that the genes encode for as well as 
their proposed or actual determined function in the biosynthetic pathway. a Assignments 
based upon BLAST search at the amino acid level unless otherwise noted. b Highest 
probability score obtained. Assignment based on biochemical studies. d Only a portion of 
the orf has been elucidated. 

Figure 4 is a graph of the UV-visible absorption spectra of purified mbp-CalC. The 
purified mpb-CalC was analyzed in the following solution: 52 ]iM mpb-CalC; 10 mM 
Tris-HCl, pH 7.5). The inset shows the results of low temperature (4.3 K) the X-band EPR 
analysis of CalC. 250 pM mpb-CalC containing 0.5 mol Fe per mol CalC was analyzed in 
10 mM Tris-HCl, pH 7.5. The spectrometer settings were as follows: field set = 2050 G; 
scan range = 4,000G; time constant = 82 s; modulation amplitude =16 G; microwave 
power = 31 uW; frequency = 9.71 Ghz; gain = 1000; determined spin quantitation = 90 ± 
10 pM Fe. 

Figure 4(b) provides the results of the mbp-CalC in vitro assay. 
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Figure 5 depicts the postulated routes for the biosynthesis of required nucleotide 
sugars. The enzymes are depicted as follows: E deox = deoxygenase; E am = 
aminotransferase; E ep = epimerase; E met = methy transferase; E^ = 4,6-dehydratase; E ox = 
oxidase; E p = nucleotidyltransferase; E red = reductase; E sh = sulfhydrytransferase. 

Figure 6 illustrates a schematic representation of the in vivo production of 
pikromycin/methymycin-calicheamicin hybrid metabolites. 

Figure 7 depicts the Streptomyces Venezuela methymycin/pikromycin gene cluster. 
Eight open reading frames (desl-desVBI) in this cluster have been assigned as genes 
involved in desosamine biosynthesis. This figure also depicts the hybrid pathway toward 
new methymycin/pikromycin derivatives (1 1 and 12) produced after heterologous 
expression of the calH gene of calicheamicin in a S. Venezuela mutant. 

Figure 8 illustrates calicheamicin's (6) four unique sugars which are crucial to tight 
DNA binding. Sugar (9) is derived from 4-amino-4 ? 6-dideoxyglucose (8) and is part of the 
restricted N-O connection between sugars A and B. Compound 8 is derived from the 
corresponding 4-ketosugar (7) via a transamination reaction. The gene calH encodes the 
desired C-4 aminotransferase allowing conversion of compound (7) to compound (8). 

Figure 9 is a map illustrating the relative loci of the 48 identified genes spanning 
approximately 65 KB of continuous sequence. Eight of the genes identified show no 
homologs in the public databases. 

Figure 10 depicts additional postulated routes for the biosynthesis of required 
nucleotide sugars. The enzymes are depicted as follows: E deox = deoxygenase; E am = 
aminotransferase; E ep = epimerase; E met = methy transferase; E od = 4,6-dehydratase; E ox = 
oxidase; E p = nucleotidyltransferase; E red = reductase; E sh = sulfhydrytransferase. 
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Figure 1 1 is a schematic showing the iodination of orsellenic acid mediated by 
CalV and CalT, as well as the subsequent steps of oxidation, mediated by CalS and CalW 
and methylation, mediated by CalD and Call Additionally, the figure shows the synthesis 
of putative substrates for the reaction. 

Figure 12 describes the mechanism of calicheamicin resistance in 
Micromonospora. calC confers calicheamicin resistance to bacteria. 

Figure 1 3 A schematic diagram of the first continuous assay for enediyne-induced 
DNA cleavage, the Molecular Break Lights. The solid lines represent covalent bonds, 
dashed lines represent hydrogen bonding, letters represent arbitrary bases, the gray shaded 
ball represents the fluorophore (FAM: fluorescein), the black ball represents the 
corresponding quencher (DABCYL:4-(4- t demethylaminophenylazo)-benzoic acid) and the 
dashed wedges represent fluorescence. Generally, molecular beacons operate by a 
separation of the fluorophore-quencher pair resulting in a corresponding fluorescent signal. 
Molecular break lights, as illustrated in the figure, operate through cleavage of the stem by 
an enzymatic or non-enzymatic nuclease activity resulting in the separation of the 
fluorophore-quencher pair and corresponding fluorescent signal. In this study, Molecular 
break lights contain either a preferred calicheamicin recognition site (bold-faced, TCCT) 
or the BamHl recognition site (bold-faced, GGATCC). The predicted cleavage sites are 
illustrated by arrows. 

Figure 14 shows the demonstration of molecular break light specificity and general 
proof of principle. The observed change in fluorescence intensity over time of an assay 
containing 3.2 nM break light at 37 °C. (a) Break light calicheamicin MLB (break light 
A) with 100 U BamHl (U),BamHI MLB (break light B) with 100 U BamHl (o) and 
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BamHIMLB without enzyme (•) (10 mM TrisHCl, 50 mM NaCl, 10 mM MgCl 2 , 1 mM 
DTT, pH 7.9; A. Ex = 485 nm, X Em = 5 1 7 nM). (b) calicheamicin MLB (break light A) with 
and 10 U DNasel (U\BamHI MLB (break light B) with 10 U DNasel (o) and 
calicheamicin MLB (break light A) without enzyme (•) (40 mM TrisHCl, 10 mM MgS0 4 , 
1 mM CaCl 2 , pH 8.0; X Ex = 485 nm, X Em ^517 nM). This is the most sensitive assay for 
BamHI and DNasel DNA cleavage activity to date. 

Figure 1 5 shows the cleavage of calicheamicin MLB (break light A) by 
calicheamicin and esperamicin. The observed DNA cleavage over time of an assay 
containing 3.2 calicheamicin MLB at 37 °C (40 mM TrisHCl, pH 7.5; X Ex = 485 nm, X Em = 
517 nM), DTT (50 ^M) and varied enediyne. (a) Calicheamicin concentrations: 3 1.7 nM 
(O), 15.9 nM (□), 3.2 nM (0), 1.6 nM (A), 0.78 nM (•) and 0.31 nM (■). (b) Esperamicin 
concentrations: 31.7 nM (o), 15.9 nM (□), 3.2 nM (0), 1.6 nM (A), 0.78 nM (•), 0.31 nM 

(■) and 0.15 nM (♦). These results represent the first continuous and most sensitive assay 
for enediyne-induced DNA cleavage. 

Figure 1 6 (a) The observed DNA cleavage over time of an assay containing a 
constant 3.2 nM break light A at 37 °C (50 mM sodium phosphate, 2.5 mM ascorbate, pH 
7.5; >. Ex = 485 nm, X Em = 517 nM) and varied bleomycin. Bleomycin concentrations: 
200 nM (o), 100 nM ( ), 50 nM (0), 25 nM (A) , 12.5 nM ( • ) , 5 nM (■) and 2.5 nM 
( ^ ) .(c) The observed DNA cleavage over time of an assay containing a constant 32 nM 
break light A at 37 °C (40 mM TrisHCl, 2.5 mM ascorbate, pH 7.5; A. Ex = 485 nm, X Bm = 
517 nM) and varied MPE. Fe(II) concentrations: 50 nM (o) , 125 nM (□) , 250 nM 
(0) , 500 nM (A) , 1 nM ( • ) and 2 pM (■ ) . (d) The observed DNA cleavage over 
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time of an assay containing a constant 32 nM break light A at 37 °C (40 mM TrisHCl, 2.5 
mM ascorbate, pH 7.5; ^ Ex = 485 nm ? ^ Em - 517 nM) and varied Fe +2 -EDTA. Fe(II) 
concentrations: 12.5 (o) , 6.3 M {□) , 3.1 (0) , and 1.3 |iM (A) . 

Figure 17 shows the direct in vitro inhibition of calicheamicin-mediated DNA 
cleavage using the break light assay. 3.6pM break light A is coincubated with 3.5nM 
calicheamicin with increasing amounts of CalC. Complete inhibition of calicheamicin is 
achieved with roughly 2-fold excess of CalC. CalC has no effect on esperamicin-induced 
cleavage of DNA. 

Figure 18 shows the interaction between CalC and "activated" calicheamicin as 
measured by an increase in tryptophan fluorescence of CalC. CalC has 5 tryptophan and 
no cysteine residues and is unaffected by the reductive activator dithiothreitol (DTT). As 
the concentration of calicheamicin (3) increases in the absence of DTT there is little 
change in the CalC Trp fluorescence intensity. The addition of DTT to "activate" 
calicheamicin (4) results in increased binding to CalC as shown by the increase in CalC 
Trp fluorescence intensity. 

Detailed Description of the Invention 

The present invention is directed to the isolation and characterization of the 
calicheamicin biosynthetic cluster. This cluster encodes the genes that encode the proteins 
and enzymes that are involved in deoxysugar synthesis (the aryltetrasaccharide), 
polyketide biosynthesis (the aglycone and aromatic residue of the aryltetrasaccharide) of 
calicheamicin synthesis, regulation, transport, cluster mobility and calicheamicin 
resistance. Forty-eight putative genes have been identified, twenty-seven of which encode 
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putative structural proteins with the remainder encoding a variety of functions. 
Specifically, there are 15 genes that encode for the aryltetrasaccharide moiety (20,928 bp; 
D, E, F, G, H, J, K, N, O, Q, S, T, U, X, W, 6MSAS), 12 putative genes which encode for 
the aglycone (13,284 bp; P, S, V, W, ActI, Actll, Actlll, Orfl, Or/Ill, OrfV, Or/VI, OrfVII), 
13 putative genes involved in membrane transport, regulation, DNA movement and/or 
resistance (19,704 bp; 4 B, C, I, L, M, R, orf4, orfS, OrJVIlI, OrfIX, OrfX, OrfXI, JS- 
element), and the remaining 8 genes of unknown function (7383 bp; orfl, orfl, orfi, orfi, 
orfi, orp, Orfll, OrflV). 

The calicheamicin biosynthetic gene cluster comprises the following genes: calA, 
calB, calC, calD, calE, calF, calG, calH, call, calJ, calK, calL, caM, calN, calO, calP, 
calQ, calR, calS, call, calU, caN , calW, calX, 6MSAS, ActI, Actll, Actlll orfl, orfi, 
orfi, orfi, orfi, orfi, orfi, orfi, orfl, orfll orflll, orflV orfV, orfVl, orfVII, orfVIII, orfIX, 
orfX, orfXI and an IS-element gene. It should be noted that orfl -8 may contain DNA 
derived in whole or in part from recombinant vectors LP46 and/or LP54. The above listed 
genes encode the following polypeptides: CalA (328 amino acids), CalB (561 amino 
acids), CalC (181 amino acids), CalD (263 amino acids), CalE (420 amino acids), CalF 
(245 amino acids), CalG (990 amino acids), CalH (338 amino acids), Call (568 amino 
acids), CalJ (332 amino acids), CalK (440 amino acids), Cal L (562 amino acids), Cal M 
(416 amino acids), CalN (398 amino acids), CalO (331 amino acids), Cal P (approximately 
179 amino acids), CalQ (453 amino acids), CalR (265 amino acids), CalS (1113 amino 
acids), CalT (280 amino acids), CalU (377 amino acids), CalV (125 amino acids), CalW 
(449 amino acids), CalX (197 amino acids), 6MSAS (198 amino acids), ActI (207 amino 
acids), Actll (136 amino acids), Actlll (308 amino acids), Orfl (322 amino acids), Orf2 
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(654 amino acids), Orf3 (209 amino acids), Orf4 (521 amino acids), Orf5 (175 amino 
acids), Orf6 (139 amino acids), Orf7 (187 amino acids), Orf8 (266 amino acids), Orfl (127 
amino acids), Orfll (248 amino acids) Orflll (298 amino acids), OrflV (363 amino acids) 
OrfV (288 amino acids), OrfVI (1012 amino acids), OrfVII (236 amino acids), OrfVIII 
(441 amino acids), OrfIX (504 amino acids), OrfX (504 amino acids), OrfXI (251 amino 
acids) and IS-element (402 amino acids). 

In elucidating the calicheamicin biosynthetic gene cluster, the inventors began with 
a genomic library containing the genome of Micromonospora echinospora spp. 
calichensis. The cosmid library was generated by isolating chromosomal DNA of 
Micromonospora echinospora spp. calichensis, fragmenting that chromosomal DNA, 
inserting the DNA into a cosmid vector and generating a cosmid library according to 
methods well known in the art. This procedure can be performed using any species of 
Micromonospora, Streptomyces, or other suitable bacteria. 

Based upon prior enediyne metabolic labeling studies it was postulated that the 
calicheamicin aglycone would be polyketide derived. Polyketide metabolites encompass a 
vast variety of structural diversities yet share a common mechanism of biosynthesis. 
Hutchinson, C.R., et al., Chem. Rev., 97, 2525-2535 (1997); Strohl, W.R., et al, 
Biotechnology of Antibiotics pp. 577-657; Fujii, I., et al., Chem. Rev., 97, 251 1-2523 
(1997); Hopwood, D.A., et al., Chem. Rev., 97, 2465-2497 (1997); Hopwood, D.A., et al., 
Ann. Rev. Genet, 24, 37-66 (1990); Staunton, J., et al., Chemical Reviews, 97, 2611-2629 
(1997). Most important, polyketide synthase ("PKS") genes display a high degree of 
sequence homology (from pathway to pathway and organism to organism) and are often 
clustered with genes encoding self resistance and deoxysugar ligand biosynthesis. 
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Hopwood, D.A., et aL, Chem. Rev., 97, 2465-2497 (1997); Hopwood, D.A., et aL, Ann. 
Rev. Genet, 24, 37-66 (1990); Staunton, J., et aL, Chem. Rev., 97, 2611-2629 (1997). 

Degenerate primers based upon conserved regions within PKS genes were used in 
Southern hybridizations to identify clones from the M echinospora genomic library that 
carried putative PKS genes. The Southern hybridizations were performed by methods 
known in the art. Southern hybridization of the genomic M echinospora cosmid library 
with a DNA probe designed to target type I PKS genes (KS 1 ), (Kakavas, S.J., et aL, J. 
BacterioL, 179, 7515-7522 (1997)), unveiled five positive clones, which were designated 
clones 4b, 10a, 13a, 56, and 60. See Figure 1. The same five clones were also identified 
upon rescreening the genomic library with type II DNA probe (actl). See Figure 1 . 
Although this preliminary analysis clearly demonstrated the presence of Micromonospora 
PKS gene homologues, a secondary screen was performed, as PKS hybridization analyses 
are often plagued by false hybridization to gene clusters that encode spore pigment 
biosynthesis. 

The second screening was based on the assumption that calicheamicin' s 
biosynthetic cluster would also contain genes encoding for deoxysugar ligand synthesis. 
Further, it was postulated that all hexopyranosyl ligands of calicheamicin diverged from 
the common intermediate 4-keto-6-deoxy TDP-D-glucose (30), Figure 5, as 
macromolecule-sugar synthesis in many organisms began with a similar common 
intermediate. Thus, it was believed that the cluster encoding for calicheamicin 
biosynthesis, in addition to carrying a PKS-encoding region, would carry both a common 
glucose-l-phosphate nucleotidyltransferase and a NDP-cc-D-glucose 4,6-dehydratase gene, 
encoding the putative enzymes E pl , and E od , respectively. See figure 5. These enzymes are 
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necessary to convert a sugar (12)(figure 5) to the hypothesized common intermediate, 4- 
keto-6-deoxy TDP-D-glucose (30). Analogs to 4,6-dehydratases have been previously 
characterized from E. coli, Salmonella, and Streptomyces. Additionally, a nucleotide 
transferase from Salmonella has been characterized as an alpha-D-glucose-1 -phosphate 
thymidylyltransferase. The secondary screen was performed using a probe based upon the 
postulation that the M. echinospords calicheamicin synthesis would begin from a similar 
precursor found in E. coli, Streptomyces and Salmonella, and that this precursor required a 
dehydratase to convert it into the common intermediate, 4-keto-6-deoxy TDP-D-glucose 
(30). In particular, a DNA probe (designated E od r ) was designed from the conserved 
NAD + -binding site of bacterial NDP-a-D-glucose 4,6-dehydratases. He, X., et al., 
Biochem., 35, 4721-4731 (1996). Southern hybridization of the genomic M echinospora 
cosmid library with the E od T probe revealed cross-hybridization with clones 4b, 10a, 13a, 
56, and 60. Two additional clones, designated 58 and 66, were also identified in this 
screen. See Figure 1. This secondary hybridization indicated the clustering of genes 
encoding both polyketide and deoxysugar biosynthesis. 

For final corroboration, since secondary metabolite biosynthesis is typically 
clustered with resistance genes in actinomycetes, all hybridization-positive clones were 
tested for their ability to grow in the presence of varying concentrations of calicheamicin. 
In this final screen, six of the seven hybridizing clones displayed differing levels of 
resistance to calicheamicin (4b~10a~13a>56>66>60)(See Figure 1) while clone 58 lacked 
the ability to grow in the presence of calicheamicin. In addition, these resistance screens 
revealed that clones 4b, 10a, 13a conferred much higher levels of resistance to 
calicheamicin than the other clones. Upon rescreening the genomic library for 
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calicheamicin-resistant clones, three additional clones (3a, 4a, and 16a) were found to 
confer similar levels of resistance. Cumulatively, the results demonstrated that clones 4b, 
10a, 13a, 56, and 60 carried PKS I and II homologues and deoxy sugar biosynthetic genes, 
as well as encoded the gene responsible for conferring calicheamicin-self resistance. 

The clones positive for PKS I and II and deoxy sugar biosynthesis homology and 
calicheamicin resistance were used to map the biosynthetic cluster. Southern hybridization 
established similarity between clones 3a, 4a, 4b, 10a, 13a, 16a and 56. In addition, 
nucleotide sequence overlaps were found between clones 4b, 13a, and 56. See Figure 1. 
Restriction mapping and Southern hybridization of these clones indicated that the positive 
cosmid clones corresponded to a continuous region of the M. echinospora chromosome 
spanning > 100 kb. The present invention thus provides for cosmids having a nucleic acid 
molecule from Micromonospora echinospora encoding for a nonchromoprotein enediyne 
biosynthetic cluster. 

After isolating the biosynthetic gene cluster and elucidating the sequence, open 
reading frames ("orfs") were assigned. Tentative gene assignments were derived from 
amino acid sequence similarity of translated orfs to gene products of known function via 
direct BLAST (Basic Local Alignment Search Tool) database searches on the amino acid 
level. Karlin, et aL, Proceed Natl Acad Set, USA., 87, 2264-2268 (1990); Karlin, et al., 
Proceed Natl Acad Set, U.S.A., 90, 5873-5877 (1993); Altchul, Nature Genet, 6, 119- 
129 (1994). The gene cluster organization is provided in figure 1. 

Based on BLAST analysis tentative gene assignments were made. Specifically, 
there are 15 genes that encode for the aryltetrasaccharide moiety (20,928 bp; D, E, F, G, H, 
J, K, N, O, Q, S, T y U, X, W, 6MSAS), 12 putative genes which encode for the aglycone 
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(13,284 bp; P, S } V, W f ActI, Actll, ActUI, Orfl> Or/Ill OrfV y OrfVI, OrfVII), 13 putative 
genes involved in membrane transport, regulation, DNA movement and/or resistance 
(19,704 bp; A, B, C, /, U M, R, orf4, orj8, OrjVlIl OrJIX, OrfiC y Orffl IS-element\ and 
the remaining 8 genes of unknown function (7383 bp; orfl, or/2, or/3, orf5, or/6, orf7, 
Orfll OrflV). 

One aspect of the invention relates to transformation of a host cell with M 
echinospora DNA. This method provides a reproducible transformation efficiency of ~10 3 
kanamycin resistant transformants/ g DNA using a pKCl 139-based vector. The invention 
further provides that the host cell can be but is not limited to bacteria, yeast, fungus, insect, 
plant or mammalian. Transformations of bacteria, yeast, fungus, insect, plant or 
mammalian cells are performed by methods known in the art. 

The present invention also provides the isolation and characterization of genes 
encoding polypeptides involved in calicheamicin resistance such as orJXI and calC . One 
aspect of the invention relates to an isolated DNA strand having the gene calC and having 
the DNA sequence SEQ. ID No.: 1 . The present invention also relates to an isolated 
protein CalC, having the amino acid sequence, SEQ ID. NO. 2. The invention further 
provides for calC gene fragments coding for a bioactive CalC polypeptide. The 
polypeptide, CalC, confers calicheamicin resistance and has 181 amino acids. The 
invention also provides for CalC fragments conferring calicheamicin resistance. 

The calC locus was isolated by identifying calicheamicin genomic cosmid clones 
that were able to grow on luria bertani ("LB") agar plates containing ampicillin and 
calicheamicin. The DNA of the positive clones (clones that grew on the plates containing 
calicheamicin) was isolated and subsequent restriction mapping localized the desired 
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phenotype (calicheamicin resistance). The DNA was then sequenced and the open reading 
frames analyzed to ascertain the orf encoding for the desired phenotype. In vitro studies 
were also performed and confirmed the ability of CalC to inhibit DNA cleavage. 

DNA containing calC was cloned into an inducible vector, using known methods, 
resulting in overexpression of calC. The polypeptide product (CalC) was then isolated and 
purified to homogeneity. Analysis of the purified CalC revealed that it is a non-heme iron 
metalloprotein that functions via inhibition of calicheamicin-induced DNA cleavage in 
vitro. Another aspect of the invention is an expression vector containing calC or a 
fragment of calC encoding for a bioactive molecule. There is also provided a transformed 
host cell, preferably bacteria, more preferably E. coli, containing calC or a fragment of 
calC encoding for a bioactive molecule. Such transgenic expression of calC results in an 
10 5 -fold increase in calicheamicin resistance in Ecoli, a 100-fold increase in resistance in 
S.lividans, and a 50-fold increase in resistance in yeast. 

The present invention provides for the transformation of human cells with the calC 
gene. The transgenic expression of calC in the HT1080 (human) cell line increased its 
resistance to calicheamicin 10-fold. This technique allows bone marrow cells, for 
example, to be removed from a patient being treated with calicheamicin, and for these cells 
to be transformed with calC, and for the transformed cells to be returned to the patient. 
This allows the patient to tolerate treatment with calicheamicin or allows the patient to 
receive higher doses of calicheamicin as the returned human-ca/C-transformed cells have 
calicheamicin resistance. The transformation is performed by methods known in the art. 
The embodiment of the invention would be applicable to many diseases being treated with 
calicheamicin. 
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The invention further provides for a method of assaying the calicheamicin-induced 
DNA cleavage and its CalC-mediated inhibition using the molecular break light assay. 
Two molecular break lights (MLBs) for the experiments are described in example 7. 
Break light A is comprised of a 10-base pair stem which contained the known 
calicheamicin recognition sequence 5'-TCCT-3\ while break light B carries the BomHl 
endonuclease recognition sequence 5'-GGATCC-3\ The 5'-fluorophore of both probes 
was fluorescein (FAM, absorbance max = 485 nm, emission^ = 517 nm) while the 
corresponding 3 '-quencher was 4-(4'-dimethylaminophenylazo)benzoic acid (DABCYL). 
Generally, MLBs operate by a separation of the fluorophore-quencher pair resulting a 
corresponding fluorescent signal. The molecular break lights, as illustrated in figure 13, 
operate through cleavage of the stem by specific enzymatic or non-enzymatic nuclease 
activity resulting in the separation of the fluorophore-quencher pair and corresponding 
fluorescent signal (see figure 14). CalC in a two-fold molar excess of calicheamicin, 
completely abolishes calicheamicin mediated DNA cleavage as monitored by the break 
light assay (see figure 1 5). 

CalC acts as a "cleavage sink". In essence the protein is cleaved as an alternative 
to the desired DNA target. Thus, the invention provides the first such demonstrated 
mechanism for resistance to a cleavage agent and explains why CalC is able to function in 
all organisms tested so far (i.e. Rcoli, SMvidans, yeast, and humans). 

The invention further provides for the use of the break light assay to determine 
calicheamicin titers during production of thereof. Furthermore, the molecular break light 
assay may be used to determine the DNA cleavage activity of calicheamicin analogs 
generated using the techniques of this invention. 
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Another aspect of the invention relates to an isolated DNA strand containing the 
caM gene having the DNA sequence SEQ ID. No: 3. The invention also relates to the 
polypeptide CalH, having amino acid sequence SEQ ID. No. 4. The invention further 
provides for caM gene fragments coding for a bioactive CalH. CalH is involved in the 
formation of the aryltetrasaccharide 4,6-dideoxy-4-hydroxylamino-D-glucose moiety. 
CalH catalyzes the conversion of intermediate (30) to intermediate (39) (figure 5). CalH is 
a TDP-6-deoxy-D-glycerol-L-threo-4-hexulose 4-transaminase, which catalyzes a pyridoxal 
phosphate ("PLP")-dependent transamination from glutamate to provide 4-amino-6-deoxy 
TDP-D glucose (intermediate 39)(figure 5). The invention also provides for CalH 
fragments that retain bioactivity. There is also provided an expression vector containing the 
caM gene or fragments of the caM gene that encode for a bioactive polypeptide. CalH 
were overexpressed as a (histidine) 10 -flxsion protein and subsequently purified by nickel 
affinity chromatography. 

According to BLAST analysis, CalH closely resembles perosamine synthase, an 
enzyme which converts compound 30 to compound 39 (See figure 5) en route to the 
biosynthesis of TDP-perosamine (TDP-4 ? 6-dideoxy-4-amino-D-mannose) in E. coll 
Wang, L., et al., Infect Immunol, 66, 3545-3551 (1998). Thus CalH is believed to be a 4- 
ketohexose aminotransferase. To confirm the tentative BLAST assigned function, a 
combinatorial biosynthesis was performed. Specifically the caM gene from calicheamicin 
was incorporated into a mutant strain of Streptomyces Venezuela. The 4-dehydrase gene 
(desl) in the methymycin/pikromycin pathway was deleted in this mutant strain. A 
promoter sequence from the S. Venezuela methymycin/pikromycin cluster was 
incorporated in the expression vector to drive the expression of foreign genes (the caM of 
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calicheamicin) in S. Venezuela. In wild type S. Venezuela methymycin/pikromycin 
pathway is known to produce methymycin, neomethymycin, pikromycin, and narbomycin. 
See figure 6. Deletion of the desl gene in the mutant strain led to the accumulation of the 
CalH substrate, TDP-4-keto-6-deoxyglucose (compound 30, figure 6). The constructed 
expression vector with the S. Venezuela promoter expressed the calH gene to make the 
CalH protein. CalH acted on the substrate, 30, to produce compound 39 (figure 6). 
Compound 39 in turn, with the action of & Venezuela's DesVII (a glycosyltransferase) 
produced two methymycin/pikromycin-calicheamicin hybrid compounds. See Figure 6, 
compounds 40 and 41. These hybrid compounds carry the 4-aminohexose ligand of 
calicheamicin. This work provides indisputable support for the calH gene assignment as 
encoding the TDP-6-deoxy -D-glycero-L-threo-4-hexulose 4-aminotransferase of the 
calicheamicin pathway. The CalH acted on the TDP-4-keto-deoxyglucose substrate 
(compound 30) to produce compound 39. (Figure 5). 

Moreover, CalH is able to directly mediate the synthesis of the product TDP-4,6- 
dideoxy-alpha-D-glucose as demonstrated by HPLC isolation of the product and 
confirmation by high-resolution mass spectrometry. In addition this compound was found 
to co-elute with chemically synthesized TDP-4-amino-4,6-dideoxy-alpha-D-glucose. 

In addition, these results reinforce the indiscriminate nature of the corresponding 
glycosyltransferase (DesVII) as they reveal that the glycosyltransferase (DesVII) of the S. 
Venezuela pathway can recognize alternative sugar substrates whose structures are 
considerably different from the original amino sugar substrate, TDP-D-desosamine. The 
results also clearly demonstrate the ability to engineer secondary metabolite glycosylation 
through a rational selection of gene combinations. The successful expression of the CalH 
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protein in S. Venezuela by the newly constructed expression vector highlights the potential 
of using this system to express other foreign genes in this strain. 

Thus, one aspect of the present invention further relates to the construction of a 
composite gene cluster having the ability to make and attach non-natural sugars. The 
invention further provides an expression vector having a calicheamicin gene operably 
linked to regulatory sequences to control expression of the calicheamicin protein, and 
preferably the regulatory sequence is a Streptomyces promoter. The present invention also 
relates to two newly synthesized sugars, compound (11) and compound (12)(figure 7). 
Compound 1 1 has the formula: 



O 




The spectral data of compound 1 1 was as follows: 

l H NMR (500 MHz CDC1 3? J in hertz) 6 6.75 (III, dd, J = 16.0, 5.5, 9-H) 6.44 (1H, 
dd, J = 16.0, 1.2, 8-H), 5.34 (1H, d, j = 8.0, N-H), 4.96 (1H, m, 1 1-H), 4.27 (1H, d, J=7.5, 
1-H), 3.66 (1H, dd, J = 9.5, 8.0, 4'-H), 3.60 (1H, d, J = 10.5, 3-H), 3.50 (1H, 1, J - 9.5, 
3'H), 3. d (1H, m, 5'-H), 3.4 (1H, m, 2'-H), 2.84 (1H, dq, J = 10.5, 7.5, 2-H), 2.64 (1H, m, 
10-H), 2.53 (1H, m, 6-H), 2.06 (3H, s, Me-OO), 1.7 (1H, m, 12-H), 1.66 (1H, m, 5-H), 
1.56 (1H, m. 12-H), 1.4 (1H, M, 5-H), 1.36 (3H, d. ? J=7.5, 2-Me), 1.25 (31 1. d, J - 6.5, 5'- 
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Me), 1.24 (1H, m. 4-H), 1.21 (3H, d, J=7.5, 6 Me), 1.10 (3H, d, J=6.5, 10-Me), 0.99 (3H, 
d, J=6.0, 4-Me), 0.91 (3H, t, J =7.2, 12-Me); 13 C NMR (125 MHz, CDC1 3 ) 6 205.3 (C-7), 
175.1 (C-l), 171.9 (Me-C-O), 147.1 (C-9), 126.1 (C-8), 103.0 (C-l'), 85.8 (C-3), 75.8 (C- 
5'), 75.8 (C-3'), 74.1 (C-l 1) 70.8 (C-2'), 57.6 (C-4'), 45.3 (C-6), 44.0 (C-2), 38.1 (C-10), 
34.2 (C-5), 33.6 (C-4), 25.4 (C-12), 23.7 (Me-C-O), 18.1 (C-6"), 17.9 (6 Me), 17.6 (4-Me), 
16.4 (2-Me), 10.5 (12-Me), 9.8 (10-Me). High-resolution FAB-MS calculated for C 25 H 42 - 
NO s (M + H + ) 484.2910, found 484.2303. 

Compound 12 has the formula: 



O 




The spectral data of compound 12 was as follows: 

'H NMR (500 MHz, CDC1 3 . J in hertz) 5 6.69 (1H, dd, J = 16.0, 6.0, 1 1-H), 6.09 
(1H, dd, J= 16.0, 1.5, 10-H), 5.35 (1H, d, J = 8.5, N-H), 4.96 (1H, m, 13-H), 4.36 (1H, d, 
J = 7.5, l'H), 4.19 (1H, m. 5-H), 3.83 (lH-q, J=6.5, 2-H), 3.68 (1H, dt, J=10.0, 8.5, 4'H), 
3.52 (1H, t, J = 8.5, 3-'H), 3.50 (1H, m, 5-H), 3.42 (1H, t, J = 7.5, 2'-H), 2.92 (1H, dq, J = 
7.0, 5.0, 4-H), 2.81 (1H, m, 8-H), 2.73 (1H, t, J=7.5, 2'-H), 2.06 (3H, a, Me-C-O), 1.8 (1H, 
m, 6-H), 1.6 (1H, m, 14-H), 1.55 (1H, m. 7-H), 1.37 (3H, d, J = 6.5, 2-Me), 1.32 (3H, d, 
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j = 7. 0? 4-Me), 1.3 (1H, m ? H-14), 1.27 (3H, d, J = 6.5, 5*-Me), 1.25 (1H, m, 7-H), 1.12 
(3H, d, J =6.0, 8-Me), 1.11 (3H, d, J = 6.5, 12-Me), 1.07 (3H, d, J = 6.0, 6-Me), 0.91 (3H, 
1 , J -7.2, 1 + Me); high resolution FAB MS calculated for C 28 H 46 N0 2 (M+H + ) 
540.3 172.found 540.3203. 

One aspect of the invention relates to an isolated DNA strand containing the calG 
gene and having the DNA sequence SEQ ID. NO.: 5. Another aspect of the invention is 
the protein, CalG, having amino acid sequence SEQ ID. No.: 6. According to BLAST 
analysis, calG encodes a 4,6-dehydratase. Dehydratases had been characterized from E. 
coli, Salmonella and Streptomyces, (Thompson, M. et al., J. Gen. Microbiol, 138, 779-786 
(1992); Vara, J.A., et al., J. Biol Chem., 263, 14992-14995 (1988)), and analogous NDP- 
D-glucose 4,6-dehydratases had been characterized from a variety of organisms. Liu, LL- 
w., et al., Ann. Rev. Microbiol, 48, 223-256 (1994); Hallis, T.M., et al, Acc. Chem. Res., 
in press (1999). Based upon these prior studies, it was known that the overall 
transformation catalyzed by 4,6-dehydratases is an intramolecular oxidation-reduction 
where an enzyme-bound NAD + receives the 4-H as a hydride in the oxidative half-reaction 
and passes the reducing equivalents to C-6 of the dehydration product in the reductive 
half-reaction. Thus, it appears that Cal G is necessary for the formation of the 
aryltetrasaccharide 4,6-dideoxy-4-hydroxylamino-D-glucose moiety. CalG appears to be a 
TDP-D-glucose 4,6-dehydratase which catalyzes the conversion of intermediate 13 into 
intermediate 30. (See figure 5). Another aspect of the invention is an expression vector 
containing calG or a fragment of calG encoding for a bioactive molecule. There is also 
provided a transformed host cell, preferably bacteria, more preferably, £. coli, containing 
calG or a fragment of calG encoding for a bioactive molecule. 
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Moreover, CalG is able to directly mediate the synthesis of the product TDP-4- 
keto-6-deoxy-alpha-D-glucose as demonstrated by an assay where in the product is known 
to absorb at 320 nm under basic conditions. In addition this compound was found to co- 
elute with chemically synthesized TDP-4-keto-6-dideoxy-alpha-D-glucose. CalG has been 
demonstrated to utilize UDP-glucose as a substrate. 

There is also disclosed an isolated DNA strand containing the calS gene. Based on 
sequence homology with other P450-oxidases, CalS appears to be a P450-oxidase 
homolog which performs the oxidation of intermediate 39 to intermediate 42 (figure 5). 
The oxidation may occur at the nucleotide sugar level or hydroxylamine formation after 
the sugar has been transferred to the aglycone. There is also provided an expression vector 
containing the calS gene or a fragment of calS encoding for a bioactive molecule. There is 
also provided a transformed host cell, preferably bacteria, more preferably £. coli, 
containing calG or a fragment of calG encoding for a bioactive molecule. 

There is also disclosed an isolated DNA strand containing the calQ gene. Based on 
sequence homology, CalQ appears to be a UDP-D-glucose-6 dehydrogenase homolog. The 
CalQ assay is based upon the requirement of this enzyme for two equivalents of NAD+ for 
activity. Thus, an assay based upon the increase in absorbance (as a result of the conversion 
of NAD+ to NADH upon the conversion of UDP-alpha-D-glucose to UDP-alpha-D- 
glucuronic acid). The product was also shown to co-elute with commercially available UDP- 
glucuronic acid and separately confirmed by high resolution mass spectrometry. This enzyme 
was also shown to utilize TDP-glucose. 

There is also provided an expression vector containing the calQ gene or a fragment 
of calQ encoding for a bioactive molecule. There is also provided a transformed host cell, 
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preferably bacteria, more preferably E. coli, containing calQ or a fragment oicalQ 
encoding for a bioactive molecule. 

The present invention allows genetic manipulation of the biosynthetic gene cluster 
to produce calicheamicin analogs. The present invention provides for producing 
calicheamicin analogs by constructing deletions or substitutions of the genes involved in 
biosynthesis of the aryltetrasaccharide. The invention further provides for in vitro 
glycosylation by altering the glycosylation pattern of calicheamicin (via a 
glycosyltransferase) to produce additional analogs. The invention also provides for 
alteration of the calicheamicin aglycone by genetic manipulation of the genes encoding the 
biosynthesis of the warhead. Genetic manipulation, such as producing deletions or 
substitutions are performed using methods known in the art. 

The invention provides for a method of purifying calicheamicin through affinity 
chromatography. Because of its homology with calicheamicin, CalC functions as a 
calicheamicin-sequestering/binding protein. Affinity chromatography is performed using 
methods known in the art. 

The invention relates to the expression of the genes located in the biosynthetic gene 
cluster by using methods known in the art to insert the genes into a suitable expression 
vector and operably linking the gene to regulatory sequences to control expression of the 
gene to produce the protein encoded by the inserted gene. The present invention also 
provides for expression of biologically active proteins by inserting fragments of genes 
selected from the biosynthetic gene cluster, which encode for biologically active proteins, 
into a suitable expression vector, using methods known in the art. The genes would be 
operably linked to regulatory sequences to control their expression. 
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The term "hybridization" as used herein is generally used to mean hybridization of 
nucleic acids at appropriate conditions of stringency as would be readily evident to those 
skilled in the art depending upon the nature of the probe sequence and target sequences. 
Conditions of hybridization and washing are well known in the art, and the adjustment of 
conditions depending upon the desired stringency by varying incubation time, temperature 
and/or ionic strength of the solution are readily accomplished. See, for example, 
Sambrook, J. et ah, Molecular Cloning: A Laboratory Manual, 2nd edition, Cold Spring 
Harbor Press, Cold Spring Harbor, New York, 1989. The choice of conditions is dictated 
by the length of the sequences being hybridized, in particular, the length of the probe 
sequence, the relative G-C content of the nucleic acids and the amount of mismatches to be 
permitted. Low stringency conditions are preferred when partial hybridization between 
strands that have lesser degrees of complementarity is desired. When perfect or near 
perfect complementarity is desired, high stringency conditions are preferred. For typical 
high stringency conditions, the hybridization solution contains 6x S.S.C., 0.01 M EDTA, 
Ix Denhardt's solution and 0.5% SDS. Hybridization is carried out at about 68°C for about 
3 to 4 hours for fragments of cloned DNA and for about 12 to about 16 hours for total 
eukaryotic DNA. For lower stringencies the temperature of hybridization is reduced to 
about 12°C below the melting temperature (TM) of the duplex. The TM is known to be a 
function of the G-C content and duplex length as well as the ionic strength of the solution. 

As used herein, the term "substantial sequence identity" or "substantial homology" 
is used to indicate that a nucleotide sequence or an amino acid sequence exhibits 
substantial structural or functional equivalence with another nucleotide or amino acid 
sequence. Any structural or functional differences between sequences having substantial 
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sequence identity or substantial homology will be de minimis; that is, they will not 
substantially affect the ability of the sequence to function as indicated in the desired 
application. Differences may be due to inherent variations in codon usage among different 
species, for example. Structural differences are considered de minimis if there is a 
significant amount of sequence overlap or similarity between two or more different 
sequences or if the different sequences exhibit similar physical characteristics even if the 
sequences differ in length or structure. Such characteristics include for example, ability to 
hybridize under defined conditions, or in the case of proteins, immunological 
crossreactivity, similar enzymatic activity, etc. 

Additionally, two nucleotide sequences are "substantially complementary" if the 
sequences have at least about 40 percent, more preferably, at least about 60 percent and 
most preferably about 90 percent sequence similarity between them. Two amino acid 
sequences are "substantially homologous" if they have at least 40%, preferably 70% 
similarity between the active portions of the polypeptides. 

As used herein, the phrase "hybridizes to a corresponding portion" of a DNA or 
RNA molecule means that the molecule that hybridizes, e.g., oligonucleotide, 
polynucleotide, or any nucleotide sequence (in sense or antisense orientation) recognizes 
and hybridizes to a sequence in another nucleic acid molecule that is of approximately the 
same size and has enough sequence similarity thereto to effect hybridization under 
appropriate conditions. It is to be understood that the size of the "corresponding portion" 
will allow for some mismatches in hybridization such that the "corresponding portion" 
may be smaller or larger than the molecule which hybridizes to it, for example 20-30% 
larger or smaller, preferably no more than about 12-15 % larger or smaller. 
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The term "functional derivative" of a nucleotide sequence (or poly- or 
oligonucleotide) is used herein to mean a fragment, variant, homology or analog of the 
nucleotide sequence of interest or of the nucleotide sequence encoding the peptide of 
interest. A functional derivative may include alternative codons for amino acids, or may 
code for different amino acids which do not substantially change the function of interest of 
the peptide encoded by the nucleotide. A functional derivative may retain at least a 
portion of the function of the nucleotide sequence of interest or of the nucleotide sequence 
encoding the peptide of interest, which function permits its utility in accordance with the 
invention. Such function may include the ability to hybridize with at least one of SEQ ID 
NOS: 1, 3, 5, 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 
49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91, 93 , or 
94; the ability to hybridize with a substantially homologous DNA from another organism 
which DNA encodes at least one of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 
26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 
74, 76, 78, 80, 82, 84, 86, 88, 90, 92 and 95 or a functional derivative thereof, or with an 
mRNA transcript thereof, or the ability to encode a protein that is a functional derivative of 
SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 
44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 
92 and 95, or the like. 

A "fragment" of the gene or nucleotide sequence refers to any subset of the 
molecule, e.g., a shorter polynucleotide or oligonucleotide. A "variant" refers to a 
molecule substantially similar to either the entire gene or a fragment thereof, such as a 
nucleotide substitution variant having one or more substituted nucleotides, but which 
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maintains the ability to hybridize with the particular gene or to encode mRNA transcript 
which hybridizes with the native DNA. A "homolog" refers to a fragment or variant 
sequence from a different genus or species. An "analog" refers to a non-natural molecule 
substantially similar to or functioning in relation to either the entire molecule, a variant or 
a fragment thereof. 

"Functional derivatives" of the proteins as described herein are fragments, variants, 
analogs, or chemical derivatives of at least one of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 
18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 
66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92 and 95, and which retain at least a 
portion of the activity of at least one of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 
24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 
72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92 and 95 or retain immunological cross reactivity 
with an antibody specific for at least one of SEQ ID NOS: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 
22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 
70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92 and 95. As used herein, a fragment of the 
protein refers to any subset of the molecule. Variant peptides may be made by direct 
chemical synthesis, for example, using methods well known in the art. An analog of a 
protein refers to a non-natural protein substantially similar to either the entire protein or a 
fragment thereof. As used herein, a chemical derivative of a protein may contain additional 
chemical moieties not normally a part of the peptide or peptide fragment. Modifications 
may be introduced into the a peptide or fragment thereof by reacting targeted amino acid 
residues of the peptide with an organic derivatizing agent that is capable of reacting with 
selected side chains or terminal residues. 
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A protein or peptide according to the invention may be produced by culturing a cell 
transformed with a nucleotide sequence of this invention (in the sense orientation), 
allowing the cell to synthesize the protein and then isolating the protein, either as a free 
protein or as a fusion protein, depending on the cloning protocol used, from either the 
culture medium or from cell extracts. Alternatively, the protein can be produced in a cell- 
free system. Rami, et al., Meth. EnzymoL, 60:459-484, (1979). 

As can be appreciated from the disclosure above, the present invention has a wide 
variety of applications. Accordingly, the following examples are offered by way of 
illustration, not by way of limitation. 

EXAMPLES 

Example 1 

To rapidly elucidate the nucleotide sequence, thermocycle sequencing was 
accomplished from pUC- or pBluescript-based subclones (using M13 primers and primer 
walking) as well as directly from isolated cosmids (via primer walking). Nucleotide 
sequence data was acquired using two Applied Biosystems automated 310 genetic 
analyzers and sequences were subsequently assembled using the Applied Biosystems 
AutoAssembler™ DNA sequence assembly software. Dear, S., et al., Nucl Acids Res., 14, 
3907-3911 (1991); Huang, X., Genomics, 14, 18-25 (1992). Orf assignments were 
accomplished using a combination of the computational programs MacVector™ 6.0 and 
Brujene. MacVector is a commercially available software package which provides the 
ability to construct a Micromonospora codon bias table (from known Micromonospora 
sequences) and subsequently use this codon bias table to search for optimal orfs. Fickett, 
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J.W., Nucleic Acids Research 10, 5303-5318 (1982). Alternatively, the shareware 
program Brujene was specifically designed for streptomycetes and assigns priority to orfs 
that illustrate a consistency high G/C% in the wobble position. 

Example 2: Isolating and Characterizing calC 

To isolate the gene(s) responsible for calicheamicin resistance in Micromonospora, 
clones conferring calicheamicin resistance were selected by growth of a Micromonospora 
genomic Afunctional cosmid library on LB plates containing ampicillin (50 |J.g ml" 1 ) and 
calicheamicin (0.25 |ig ml" 1 ). In this selection, six clones (3a, 4a, 4b, 10a, 13a and 16a) 
displayed resistance to calicheamicin. Restriction mapping of these clones localized the 
desired phenotype to a ~2kb Pstl-Sacl fragment of DNA. (Figure 2). Maximum tolerated 
concentrations of calicheamicin on the LB plates was ascertained. The results are as 
follows: 



Cosmid or Plasmid 


Maximum tolerated concentration of 
calicheamicin 


cosmids 3 a, 4a, 10a, 13 a, and 16a 


0.5 |ag ml" 1 


pJT1214 and pJT 1232 


5.0 iag ml 1 


pRE7 


20.0 |ig ml" 1 


induced pRE7 


50.0 (ig ml" 1 


pJT1224 , pAP6, Prel, and control 
plasmids pUC18, pBluescript, and pMAL- 


<0.01 |ag ml" 1 
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Nucleotide sequence analysis of the Pstl-SacI fragment suggested that it contained 
two possible orfs. The proximal 1 kb of this fragment carried the single orf calD while the 
distal 1 kb presented orf calC. Computer translation of calC and subsequent BLAST 
analysis revealed no homology with known proteins, while the translation of calD to its 
respective protein, CalD, revealed the presence of three amino acid motifs typically 
conserved in S-adenosylmethionein-utilizing O-methyltransferases. Therefore, it was 
hypothesized that calD was not responsible for calicheamicin resistance. To rule out calD 
as being responsible for calicheamicin resistance, a subclone was engineered (pJT1224) to 
contain an intact calD, but the truncated calC gene. This subclone was not able to confer 
resistance to calicheamicin. Next, a subclone containing the calC region was constructed 
(pJT1232). This clone conferred calicheamicin resistance, as indicated in the above chart. 

To ascertain the amino acid sequence of CalC and learn its properties, calC was 
cloned into a pMAL-C2 vector. (pMAL-C2 by itself could not confer calicheamicin 
resistance. See above chart.) The resulting plasmid, pRE7, which contained calC, 
conferred resistance to calicheamicin. See above chart. Plasmid pRE7 was then induced 
with isopropyl Beta-D-thiogalactoside ("IPTG") to overexpress CalC. Induced pRE7 
conferred resistance to calicheamicin and produced a maltose-binding protein CalC fusion 
protein (mbp-CalC). This resulting overexpression of CalC increased calicheamicin 
resistance 10 2 -fold in vivo. See above chart. 
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Example 3: Expression of protein CalC 

The protein mbp-CalC was overexpressed and purified for further analysis. The 
mbp-CalC was purified from pRE7/£'. coli to homogeneity as judged by SDS-PAGE. An 
overnight LB culture (containing 50 mg ml" 1 ampicillin and 50 ng ml" 1 calicheamicin from 
a fresh pKEl/E. coli colony was grown at 37 °C, 250 rpm to an A 600 =0.5, induced with 0.5 
mM IPTG and growth continued overnight. Cells were harvested (4,000 x g, 4 °C, 20 
minutes), resuspended in buffer A (50mM Tris-Cl, pH 7.5, 200 mM NaCl, ImM EDTA) 
and disrupted by sonication. The cell debris was removed by centrifugation (5,000xg, 
4°C, 20 minutes). The supernatant was applied to an amylose affinity column (1.5 x 7.0 
cm, 1 mL min ] ). The desired mbp-CalC protein was eluted with buffer A containing 10 
mM maltose. The eluate was concentrated and chromatographed on an S-300 column 
(50mM Tris-Cl, pH 7.5, 200 mM NaCl). Active fractions were used immediately or 
frozen at -80 °C for storage. 

Example 4: Verification of CalC's calicheamicin resistance 

Given that calicheamicin leads to double strand DNA cleavage and CalC provides 
calicheamicin-resistance in vivo, it was expected that the addition of CalC to an in vitro 
calicheamicin-induced DNA cleavage assay would inhibit DNA cleavage. To test this 
theory, preliminary assays were performed with supercoiled pBlusecript plasmid DNA 
("pBS") as the template, and dithiothreitol ("DTT") as the reductive initiator. In a typical 
assay, purified mbp-CalC (15.0 nM) and 30.0 nM calicheamicin were preincubated for 15 
min. in a total volume of 25 \iL 40 mM Tris-Cl, pH 7.5, at 37 °C. Then 2.5 \xL lOmM 
DTT stock solution was added to the assay solution, and the assay was incubated an 
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additional 1 hour at 37 °C. DNA fragmentation was assessed by electrophoresis on a 1% 
agarose gel stained with ethidium bromide. Using this assay, it was found that mbp-CalC 
could completely inhibit calicheamicin-induced DNA cleavage at concentrations nearing 
10 3 -fold excess of calicheamicin. Preincubation of mbp-CalC and DTT, protein removal 
via forced dialysis, and the subsequent use of the DTT solution as reductant did not 
noticeably affect the amount of DNA cleavage. 

As indicated in Figure 4(b), no DNA cleavage was observed in the absence of DTT 
or calicheamicin (lanes a and b), while efficient cleavage was demonstrated in the presence 
of DTT and calicheamicin (lane c). As expected, the addition of mbp-CalC completely 
inhibited calicheamicin-induced DNA cleavage (lane f) while the addition of mbp alone 
(lane d) as a control, failed to inhibit calicheamicin-induced DNA cleavage. Furthermore, 
preincubation of mbp-CalC with DTT (not shown), or apo-mbp-CalC (lacking the Fe 
cofactor)(lane e), also failed to inhibit calicheamicin-induced DNA cleavage. However, 
the addition of Fe +2 or Fe +3 to the apo-mbp-Ca\C assay could reconstitute CalC activity 
(lane g). Reconstitution of apo-mbp-CalC was accomplished by preincubation with 1 mM 
FeS0 4 (Fe +2 ) or FeCl 3 (Fe +3 ) prior to the activity assay as previously described. 

Example 5: Production of methymycin/pikromycin-calicheamicin hybrid compounds 

The 1 .2 kb calR gene was amplified by polymerase chain reaction (PCR) from 
pJSTl 192 kpn7 , which is a subclone containing a 7.0 kb Kpnl fragment of cosmid 13a. The 
amplified gene was cloned into the EcoRl/Xbal site of the expression vector pDHS617. 
This expression vector contains an apramycin resistance marker. The plasmid pDHS617 
was derived from pOJ1446 (Bierman, M. et al., Gene 1992, 116, 43-49). A promoter 
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sequence from the S. Venezuela methymycin/pikromycin cluster was incorporated in the 
plasmid to drive the expression of foreign genes in S. Venezuela. The resulting plasmid, 
pLZ-C242 (containing the caM gene insert and the promoter sequence) was introduced by 
conjugal transfer using Kcoli S 17-1 into a previously constructed S. Venezuela mutant, 
desl. (Borisova, S. et aL, Org. Lett. 1999, 1. 133-136). In the DesI mutant, the desl was 
replaced by the neomycin resistance gene, which confers resistance to kanamycin The 
PLS-C242-containing S. venezuela-Desl colonies were identified on the basis of their 
resistance to apramycin antibiotic. One of these positive colonies, DesI/calH-1 was grown 
in 100 ml of seed medium at 29 °C for 48 hours and then inoculated and grown in five 
Liters of vegetative medium. Cane, D.E., et aL, J. Am. Chem. Soc, 1993, 115, 522-526. 
The culture was centrifuged to remove cellular debris and mycella. The supernatant was 
adjusted to pH 9.5 with concentrated KOH, followed by chloroform extraction. The crude 
products (700 mg) were subjected to flash chromatography on silica gel using a gradient of 
1-20% methanol in chloroform. A major product, 1 0-deoxymethynolide (ca. 400 mg), and 
a mixture of two minor macrolide compounds were obtained. The two macrolides were 
further purified by HPLC on a C I8 column using an isocratic mobile phase of 
acetonitrile/H 2 0 (1:1). They were later identified as compound (1 1) and compound 
(12)(figure 7) by spectral anaylses. 

Example 6: Molecular Break Light Assay 

The invention further provides for a method of assaying the calicheamicin-induced 
DNA cleavage and its CalC mediated inhibition using the molecular break light assay. 
Two molecular break lights for the experiments are shown in Fig. 13. Break light A was 
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comprised of a 10-base pair stem which contained the known calicheamicin recognition 
sequence 5'-TCCT-3\ while break light B carried the BamHl endonuclease recognition 
sequence 5'-GGATCC-3\ The length of break light B also considered the requirement of 
a 3 base pair overhang required for BamHl recognition and the stem of break light A was 
adjusted to a comparable length and melting temperature. The loop of both probes 
consisted of a T 4 loop to ensure non-hybridizing interactions. The 5'-fluorophore of both 
probes was fluorescein (FAM, absorbance max = 485 nm, emission^ = 517 nm) while the 
corresponding 3' -quencher was 4-(4 , -dimethylaminophenylazo)benzoic acid (DABCYL). 
Previous studies have shown DABCYL to serve as a universal quencher in molecular 
beacons and there is significant spectral overlap (1.02 x 10" 15 M" 1 cm 3 ) between the 
emission spectrum of FAM and the absorption spectrum of DABCYL. In a typical 
molecular beacon, the quenching efficiency of this pair via FRET has been shown to be 
essentially complete (99.9%), providing a significant enhancement of the signal to noise 
ratio as compared to typical complementary oligonucleotide pair FRET-based assays. 

Enzymatic Cleavage as Proof of Principle. The first test was to demonstrate the 
specificity of the designed molecular break lights via enzymatic cleavage. Specifically, 
only break light B should cleave in the presence of the restriction endonuclease BamHl 
while both A and B should be digested by the non-specific nuclease DNaseL As 
anticipated, Fig. 14a reveals a time dependent and [BamHl] -dependent increase of 
fluorescence only with B while A shows no change at 37 °C. Fig. 14b illustrates an 
increase of fluorescence over time with either break light A or B when digested with 
DNasel which is also [DNasel] -dependent. In comparison, control samples containing 
break lights alone or break lights in the presence of BSA gave no change in fluorescence 
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over > 2 hr at 37 °C. Given the lack of fluorescence in the absence of enzyme, the 
designed break lights show no appreciable melting at the designated assay temperature. 
Furthermore, these experiments clearly demonstrate the specificity of cleavage by BamHl 
for B and, for the first time, illustrate the principle application of molecular break lights to 
assess DNA cleavage. 

Interestingly, the fluorescence maximum intensity obtained upon complete BamHl 
cleavage was only 75% that observed in the presence of DNasel at the same concentration 
of molecular break light. Furthermore, after the BamHl reaction was complete, the 
addition of BamHl showed no change while the addition of DNasel resulted in additional 
cleavage to give the expected 100% fluorescence maximum. This observation suggests the 
poly-guanidine tail left attached to FAM upon BamHl digestion quenches the fluorescent 
signal by -25%. Consistent with this finding, PAGE analysis of the reaction products 
confirmed the presence of a 3 -base overhang after excess treatment with BamHl which is 
completely degraded upon DNasel digestion. As a result, the fluorescence maxium 
observed with excess BamHl was designated 100% cleavage for the BamHl kinetic studies 
described below. 

Enediyne-Catalyzed Cleavage. Previous assays for enediyne cleavage of DNA relied 
upon discontinuous assays using radioactive DNA probes, electrophoresis and subsequent 
phosphoimager analysis. In contrast, by using break lights one can directly follow the 
extent of DNA cleavage by a specific enediyne in real time with high sensitivity. To 
demonstrate, Fig. 15a,b and Fig. 16a,c,d illustrate cleavage of break light A with varying 
concentrations of either (1) naturally-occurring enediynes including esperamicin, (2), non- 
enediyne small molecule agents (such as bleomycin (3) methidiumpropyl-Fe-EDTA, (4), 



49 



and Fe-EDTA, (5)) as well as the restriction endonuclease Bamlil) in the presence of 
excess reductive activator DTT. Under the conditions described, this assay allows the 
detection of 1 in the pM range. This sensitivity compares to that of the biochemical 
induction assay (BIA), the method of choice in detecting DNA-damaging agents. 
Furthermore, the sensitivity can be significantly enhanced by simply increasing the 
concentration of the molecular break light in the assay as demonstrated with the iron- 
dependent agents. The observed maximum fluorescence obtained upon cleavage of 3.2 
nM break light A with either 1 or 2 was identical to that observed with DNasel, consistent 
with complete degradation of the oligonucleotide. As controls, incubation of molecular 
break light A with either DTT or enediyne alone revealed no change in fluorescence. 
Furthermore, although there is some debate regarding the "specificity" of 1, molecular 
break light B was cleaved by 1 at an identical rate. This supports the view that the 
specificity of 1 is more dependent upon context and perhaps less so on DNA sequence. It 
should also be noted that 1 leads to predominately double-stranded cleavage while 2 
provides single-stranded nicks and the current molecular break light assay can not 
distinguish these two phenomena. 

Interestingly, two distinct rates were observed in the enediyne molecular break 
light assay. The first (0-50 seconds) is a lag time most likely attributed to the enediyne 
activation while the second (50-200 seconds) is indicative to the initial velocity of DNA 
cleavage. To confirm this, assays were also established in which DTT and enediyne were 
first preincubated for 1 -5 min followed by initiation via the addition of the substrate 
oligonucleotide. In these preincubation experiments, the previously observed "lag time" 
attributed to activation was no longer evident while the initial velocity of DNA cleavage 
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was identical to that determined in the standard assay. Preincubation for longer periods (> 
30 min) revealed the same phenomenon, suggesting "activated" enediynes are perhaps 
more stable in an aqueous aerobic environment than previously estimated. 

CalC inhibits calicheamicin mediated DNA cleavage. As illustrated in figure 17, 
CalC directly inhibits of calicheamicin-mediated DNA cleavage in the break light assay. 
3.6pM break light A is coincubated with 3.5nM calicheamicin with increasing amounts of 
CalC (O.Onm, 1.3nm, 2.6nm, 3.9nm, 5.2nm). Complete inhibition of calicheamicin is 
achieved with roughly 2-fold excess of CalC. CalC has no effect on esperamicin-induced 
cleavage of DNA (data not shown). 

All publications, patents and patent applications referred to herein are incorporated 
in this application by reference in their entirety to the same extent as if each individual 
publication, patent or patent application was specifically and individually indicated to be 
incorporated by reference in its entirety. 
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CLAIMS 

1 . An isolated nucleic acid molecule from a nonchromoprotein enediyne biosynthetic 
gene cluster from Micromonospora echinospora comprising said nucleic acid 
molecule, a portion or portions of said nucleic acid molecule wherein said portion 
or portions encode a protein or proteins, a portion or portions of said nucleic acid 
molecule wherein said portion or portions encode a biologically active fragment of 
a protein or proteins, a single-stranded nucleic acid molecule derived from said 
nucleic acid molecule, or a single-stranded nucleic acid molecule derived from a 
portion or portions of said nucleic acid molecule. 

2. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule comprises at least one of calA, calB, calC, calD, calE, calF, calG, cafH, 
call, call, calK, calL, calM, calN, calQ, cal?, calQ, calR, calS, call, calU, caN, 
cam, calX, 6MSAS, Actl, Actll Actlll or/l, or/2, or/3, or/4, or/5, orfb, or/7, or/8, 
or/I, orfll or/III, or/JV or/V, or/VI, or/VII, or/Vlll or/IX, or/X, orfXI, or an IS- 
element gene. 

3. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes two or more proteins. 

4. The isolated nucleic acid molecule of Claim 1, wherein said molecule comprises a 
complete nonchromoprotein enediyne biosynthetic gene cluster. 

5. The isolated nucleic acid molecule of Claim 1, wherein said nonchromoprotein 
enediyne is calicheamicin. 

6. An isolated nucleic acid molecule capable of hybridizing with a nucleic acid 
molecule from Micromonospora echinospora spp. calichensis, wherein said 
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nucleic acid molecule from Micromonospora echinospora spp. calichensis encodes 
one or more proteins from a nonchromoprotein enediyne biosynthetic gene cluster. 

7. The isolated nucleic acid molecule of Claim 6, wherein said molecule encodes a 
protein having the activity of at least one protein from said nonchromoprotein 
enediyne biosynthetic gene cluster. 

8. The isolated nucleic acid molecule of Claim 6 ? wherein said nucleic acid molecule 
comprises at least one of calA, ca/B, ca/C, calD, calE, calF, calG, catH, call, caU, 
calK, calL, calM, calN, calO, calP, calQ, calR, calS, caYY, call], caN, caW, 
calX, 6MSAS, ActI, Actll, Actlll orfl, orfl, orfl, orf4, orfl, orfS, orfl, orf8, orfl, 
orfll or/III, orflV orjV, or/VI, orfVIl orJVIIl orfiX, orfX, orJXI or an IS-element 
gene. 

9. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No. 1 . 

10. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.3. 

1 1 . The isolated nucleic acid molecule of Claim 1 , wherein said nucleic acid molecule 
comprises SEQ ID No.5. 

12. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No. 7 

13. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.9 

14. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.l 1 
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15. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No. 13 

16. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No. 15 

17. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No. 17 

18. The isolated nucleic acid molecule of Claim 1 ? wherein said nucleic acid molecule 
comprises SEQ ID No. 19 

19. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.21 

20. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.23 

21 . The isolated nucleic acid molecule of Claim 1 , wherein said nucleic acid molecule 
comprises SEQ ID No.25 

22. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.27 

23. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.29 

24. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.31 

25. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.33 
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26. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No. 3 5 

27. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.37 

28. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.39 

29. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.41 

30. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.43 

3 1 . The isolated nucleic acid molecule of Claim 1 , wherein said nucleic acid molecule 
comprises SEQ ID No.45 

32. The isolated nucleic acid molecule of Claim 1 , wherein said nucleic acid molecule 
comprises SEQ ID No.47 

33. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.49 

34. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No. 51 

35. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No. 53 

36. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No. 5 5 
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37. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.57 

38. The isolated nucleic acid molecule of Claim 1 , wherein said nucleic acid molecule 
comprises SEQ ID No.59 

39. The isolated nucleic acid molecule of Claim 1 , wherein said nucleic acid molecule 
comprises SEQ ID No.61 

40. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No. 63 

41 . The isolated nucleic acid molecule of Claim 1 , wherein said nucleic acid molecule 
comprises SEQ ID No.65 

42. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No. 67 

43. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No. 69 

44. The isolated nucleic acid molecule of Claim 1 ? wherein said nucleic acid molecule 
comprises SEQ ID No. 71 

45. The isolated nucleic acid molecule of Claim 1 , wherein said nucleic acid molecule 
comprises SEQ ID No. 73 

46. The isolated nucleic acid molecule of Claim 1 , wherein said nucleic acid molecule 
comprises SEQ ID No.75 

47. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.77 
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48. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.79 

49. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No. 81 

50. The isolated nucleic acid molecule of Claim 1 , wherein said nucleic acid molecule 
comprises SEQ ID No. 83 

5 1 . The isolated nucleic acid molecule of Claim 1 , wherein said nucleic acid molecule 
comprises SEQ ID No. 85 

52. The isolated nucleic acid molecule of Claim 1 , wherein said nucleic acid molecule 
comprises SEQ ID No.87 

53. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.89 

54. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.91 

55. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.93 

56. The isolated nucleic acid molecule of Claim 1, wherein said nucleic acid molecule 
comprises SEQ ID No.94 

57. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a P 450 oxidase from Micromonospora echinospora spp. 
calichensis. 
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58. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a membrane transporter from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

59. The isolated nucleic acid molecule of Claim 1 ? wherein said isolated nucleic acid 
molecule encodes an 0-methyltransferase from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

60. The isolated nucleic acid molecule of Claim 1 9 wherein said isolated nucleic acid 
molecule encodes a glycosyltransferase from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

61 . The isolated nucleic acid molecule of Claim 1 , wherein said isolated nucleic acid 
molecule encodes a A^A^-dimethyltransferase from a gene cluster of 
Micromonospora echinospora spp. calichensis coding for calicheamicin 
biosynthesis. 

62. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a dipeptide transporter from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

63. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a L-cysteine/cystine C-S-lyase from a gene cluster of 
Micromonospora echinospora spp. calichensis coding for calicheamicin 
biosynthesis. 

64. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes an oligopeptide transporter protein from a gene cluster of 
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Micromonospora echinospora spp. calichensis coding for calicheamicin 
biosynthesis. 

65. The isolated nucleic acid molecule of Claim 1, which encodes a polypeptide 
encoding for a regulatory protein from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

66. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a hexopyranosy 1-2-3 -reductase from Micromonospora 
echinospora spp. calichensis. 

67. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a desaturase from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

68. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes an UDP-D-glucose 6-dehydrogenase from Micromonospora 
echinospora spp. calichensis. 

69. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a transcriptional regulator from a gene cluster of 
Micromonospora echinospora spp. calichensis coding for calicheamicin 
biosynthesis. 

70. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes an oxygenase from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 
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7 1 . The isolated nucleic acid molecule of Claim 1 , wherein said isolated nucleic acid 
molecule encodes a halogenase from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

72. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a P-keto-acyl synthase III from a gene cluster of 
Micromonospora echinospora spp. calichensis coding for calicheamicin 
biosynthesis. 

73. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a cytochrome P450 from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

74. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a TDP-4-keto-6-deoxy-L-hexose 2,3-dehydrogenase from a gene 
cluster of Micromonospora echinospora spp. calichensis coding for calicheamicin 
biosynthesis. 

75. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes an orsellenic acid synthase from a gene cluster of 
Micromonospora echinospora spp. calichensis coding for calicheamicin 
biosynthesis. 

76. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a polyketide cyclase from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 
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77. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a polyketide synthase from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

78. The isolated nucleic acid molecule of Claim 1 , wherein said isolated nucleic acid 
molecule encodes an integrase from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

79. The isolated nucleic acid molecule of Claim 1 , wherein said isolated nucleic acid 
molecule encodes a chromosome partitioning protein from a gene cluster of 
Micromonospora echinospora spp. calichensis coding for calicheamicin 
biosynthesis. 

80. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a hydroxylase from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

81. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes an aminotransferase from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

82. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a glu-ammonia-ligase andenylyltransferase from a gene cluster 
of Micromonospora echinospora spp. calichensis coding for calicheamicin 
biosynthesis. 

83. The isolated nucleic acid molecule of Claim 1 , wherein said isolated nucleic acid 
molecule encodes a methyltransferase from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 
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84. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes an integral membrane protein from a gene cluster of 
Micromonospora echinospora spp. calichensis coding for calicheamicin 
biosynthesis. 

85. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes a membrane protein from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

86. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes an immunity resistance protein from a gene cluster of 
Micromonospora echinospora spp. calichensis coding for calicheamicin 
biosynthesis. 

87. The isolated nucleic acid molecule of Claim 1, wherein said isolated nucleic acid 
molecule encodes an insertional element from a gene cluster of Micromonospora 
echinospora spp. calichensis coding for calicheamicin biosynthesis. 

88. An expression vector comprising a nucleic acid molecule encoding a protein or 
biologically active fragment of a protein, wherein said nucleic acid molecule is a 
nucleic acid molecule of Claim 1 . 

89. The expression vector of Claim 88, wherein said nucleic acid molecule is operably 
linked to regulatory sequences to control expression of said protein or polypeptide. 

90. The expression vector of Claim 89, wherein the regulatory sequence is a 
Streptomyces promoter. 

91 . A host cell transformed with the nucleic acid molecule of Claim 1 . 

92. A host cell transformed with the expression vector of Claim 88. 
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93. A host cell transformed with the expression vector of Claim 89. 

94. The host cell of Claim 91, wherein said host cell is a bacterium, yeast, insect, plant, 
fungi, or mammalian cell. 

95. The host cell of Claim 91 , wherein the host bacteria is E. coli or Streptomyces. 

96. A cosmid comprising an isolated nucleic acid molecule from a nonchromoprotein 
enediyne biosynthetic gene cluster from Micromonospora echinospora^ wherein 
said isolated nucleic acid molecule comprises said nucleic acid molecule, a portion 
or portions of said nucleic acid molecule wherein said portion or portions encode a 
protein or proteins, a portion or portions of said nucleic acid molecule wherein 
said portion or portions encode a biologically active fragment of a protein or 
proteins, a single-stranded nucleic acid molecule derived from said nucleic acid 
molecule, or a single-stranded nucleic acid molecule derived from a portion or 
portions of said nucleic acid molecule. 

97. The cosmid of Claim 96, wherein said nucleic acid molecule comprises at least one 
of ca/A, ca/B, calC, calD, calE, calF, calG, calH, call, ca/J, ca/K, calL, ca/M, 
calN, calO, calP, ca/Q, ca/R, ca/S, ca/T, ca/U, calV, ca/W, calX, 6MSAS, Actl 
Actll Actffl, orf\, or/2, or/3, or/4, orfi, or/6, or/7, or/8, orfl, orfll orflll, orflV 
orjV^ orjVI^ orjVH, orjVIII, orflX^ orjX, orfiCI or an IS-element gene. 

98. A method of expressing a protein comprising the steps of transfecting a host cell 
with the expression vector of Claim 88 and incubating said cell for a length of time 
and under conditions sufficient for expression of a desired quantity of said protein 
or said biologically active fragment of a protein. 
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99. The method of Claim 97, wherein said host cell is a bacterium, yeast, insect, plant, 
fungi, or mammalian cell. 

100. A method of purifying calicheamicin using affinity chromatography, comprising 
the steps of exposing a solution containing calicheamicin to an affinity column 
having CalC bound thereto, and recovering calicheamicin. 

101 . A polypeptide comprising amino acid sequence SEQ ID. No.: 2. 

102. A polypeptide comprising amino acid sequence SEQ ID. No.: 4. 

103. A polypeptide comprising amino acid sequence SEQ ID. No.: 6. 

104. A polypeptide comprising amino acid sequence SEQ ID. No.: 8. 

105. A polypeptide comprising amino acid sequence SEQ ID. No.: 10. 

106. A polypeptide comprising amino acid sequence SEQ ID. No.: 12. 

107. A polypeptide comprising amino acid sequence SEQ ID. No.: 14. 

108. A polypeptide comprising amino acid sequence SEQ ID. No.: 16. 

109. A polypeptide comprising amino acid sequence SEQ ID. No.: 18. 

110. A polypeptide comprising amino acid sequence SEQ ID. No.: 20. 

111. A polypeptide comprising amino acid sequence SEQ ID. No.: 22. 

112. A polypeptide comprising amino acid sequence SEQ ID. No.: 24. 

113. A polypeptide comprising amino acid sequence SEQ ID. No.: 26. 

1 14. A polypeptide comprising amino acid sequence SEQ ID. No.: 28. 

115. A polypeptide comprising amino acid sequence SEQ ID. No.: 30. 

116. A polypeptide comprising amino acid sequence SEQ ID. No.: 32. 

117. A polypeptide comprising amino acid sequence SEQ ID. No.: 34. 

118. A polypeptide comprising amino acid sequence SEQ ID. No.: 36. 



64 



1 19. A polypeptide comprising amino acid sequence SEQ ID. No.: 38. 

120. A polypeptide comprising amino acid sequence SEQ ID. No.: 40. 

121 . A polypeptide comprising amino acid sequence SEQ ID. No.: 42. 

122. A polypeptide comprising amino acid sequence SEQ ID. No.: 44. 

123. A polypeptide comprising amino acid sequence SEQ ID. No.: 46. 

124. A polypeptide comprising amino acid sequence SEQ ID. No.: 48. 

125. A polypeptide comprising amino acid sequence SEQ ID. No.: 50. 

126. A polypeptide comprising amino acid sequence SEQ ID. No.: 52. 

127. A polypeptide comprising amino acid sequence SEQ ID. No.: 54. 

128. A polypeptide comprising amino acid sequence SEQ ID. No.: 58. 

129. A polypeptide comprising amino acid sequence SEQ ID. No.: 60. 

130. A polypeptide comprising amino acid sequence SEQ ID. No.: 62. 

131. A polypeptide comprising amino acid sequence SEQ ID. No.: 64. 

132. A polypeptide comprising amino acid sequence SEQ ID. No.: 66. 

133. A polypeptide comprising amino acid sequence SEQ ID. No.: 68. 

134. A polypeptide comprising amino acid sequence SEQ ID. No.: 80. 

135. A polypeptide comprising amino acid sequence SEQ ID. No.: 82. 

136. A polypeptide comprising amino acid sequence SEQ ID. No.: 84. 

137. A polypeptide comprising amino acid sequence SEQ ID. No.: 86. 

138. A polypeptide comprising amino acid sequence SEQ ID. No.: 88. 

139. A polypeptide comprising amino acid sequence SEQ ID. No.: 90. 

140. A polypeptide comprising amino acid sequence SEQ ID. No.: 92. 

141 . A polypeptide comprising amino acid sequence SEQ ID. No.: 95. 
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142. A method of conferring calicheamicin resistance on a subject comprising the steps 
of obtaining cells from the subject, transforming the cells with a calicheamicin self 
resistance gene, and returning the cells to the subject. 

143. A compound having the structure: 



O 




144. A compound having the structure: 



O 




Me 
12 



145. The isolated nucleic acid molecule of claim 1, wherein said protein comprises at 
least one of amino acid sequence SEQ ID Nos.: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 
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22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 
66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 92, or 95. 

146. The isolated nucleic acid molecule of claim 1, wherein said biologically active 
fragment of a protein comprises a biologically active portion of at least one of 
amino acid sequence SEQ ID Nos.: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 
30, 32, 34, 36, 38, 40, 42, 44, 46, 48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 
74, 76, 78, 80, 82, 84, 86, 88, 90, 92, or 95. 

147. A polypeptide comprising amino acid sequence SEQ ID. No.: 56. 

148. An isolated nucleic acid molecule comprising at least one of the nucleotide 
sequences of SEQ ID Nos. 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 19,21,23,25, 27, 29,31, 
33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 
77, 79, 81 , 83, 85, 87, 89, 91, 93 , or 94, or a portion of portions thereof or an allele 
or alleles thereof, wherein said isolated nucleic acid molecule encodes a 
biologically functional protein or portion of a protein. 

149. A polypeptide comprising the amino acid sequence of at least one of SEQ ID Nos. 
2, 4, 6, 8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46, 
48, 50, 52, 54, 56, 58, 60, 62, 64, 66, 68, 70, 72, 74, 76, 78, 80, 82, 84, 86, 88, 90, 
92, or 95, or a functional variant of one or more of those polypeptides. 
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Abstract 

An isolated gene cluster of Micromonospora echinospora which codes for 
calicheamicin biosynthesis. The biosynthetic gene cluster contains genes encoding 
proteins and enzymes used in the biosynthetic production of calicheamicin, including the 
aryltetrasaccharide and aglycone. The gene cluster also includes the gene coding for the 
protein conferring calicheamicin resistance. The invention also provides isolated genes of 
the biosynthetic cluster and their corresponding proteins. In addition, the invention relates 
to DNA hybridizing with the calicheamicin gene cluster and the isolated genes of that 
cluster. Expression vectors containing genes of the biosynthetic gene and their functional 
variants are also provided. The invention also relates to host cells conjugated with DNA 
isolated from the Micromonospora echinospora spp. calichensis genome. 
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Summary of cosmid clones isolated from M. echinospora genomic library. 
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SEQUENCE LISTING 



<110> Jon S. THORSON 

<120> MICROMONOSPORA ECHINOSPORA GENES 
ENCODING FOR BIOSYNTHESIS OF 
CALICHEAMICIN AND SELF -RES I STANCE THERETO 



<130> 2653-26 

<140> 09/457,045 
<141> 1999-12-07 

<150> 60/111,325 
<151> 1998-12-07 

<160> 95 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 546 
<212> DNA 
<213> Bacteria 

<220> 
<221> CDS 
<222> (1) . . . (546) 

<400> 1 

atg act cag gag aag acc gca ccg gcc gcg aag age acg acc acc aag 
Met Thr Gin Glu Lys Thr Ala Pro Ala Ala Lys Ser Thr Thr Thr Lys 
15 10 15 

age acc gcc gcg aag aag ccg aag ccc ccg aac tac gac ccg ttc gtc 
Ser Thr Ala Ala Lys Lys Pro Lys Pro Pro Asn Tyr Asp Pro Phe Val 
20 25 30 

egg cac age gtc act gtc aag gcc gac cgc aag acc gcc ttc aag acg 144 
Arg His Ser Val Thr Val Lys Ala Asp Arg Lys Thr Ala Phe Lys Thr 
35 40 45 

ttc etc gaa ggc ttt ccg gag tgg tgg ccg aac aac ttc cgc acc acc 192 
Phe Leu Glu Gly Phe Pro Glu Trp Trp Pro Asn Asn Phe Arg Thr Thr 
50 55 60 

aag gtc ggg gcc ccg ctg ggc gtc gac aag aag ggc ggc cgc tgg tac 24 0 
Lys Val Gly Ala Pro Leu Gly Val Asp Lys Lys Gly Gly Arg Trp Tyr 
65 70 75 80 

gag ate gac gag cag ggc gag gag cac acc ttc ggc ctg ate egg aag 2 88 
Glu He Asp Glu Gin Gly Glu Glu His Thr Phe Gly Leu He Arg Lys 
85 90 95 



48 



96 
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gtg gac gag ccg gac acg ctg gtc ate ggc tgg egg etc aac ggc ttc 
Val Asp Glu Pro Asp Thr Leu Val lie Gly Trp Arg Leu Asn Gly Phe 
100 105 110 



336 



ggc egg ate gac ccg gac aac teg age gag ttc ace gtg acc ttc gtg 3 84 
Gly Arg lie Asp Pro Asp Asn Ser Ser Glu Phe Thr Val Thr Phe Val 
115 120 125 

gee gac ggc cag aag aag acc egg gtg gac gtc gag cac acc cac ttc 432 
Ala Asp Gly Gin Lys Lys Thr Arg Val Asp Val Glu His Thr His Phe 
130 135 140 

gac egg atg ggc acc aag cac gee aag egg gtc cgc aac ggc atg gac 480 
Asp Arg Met Gly Thr Lys His Ala Lys Arg Val Arg Asn Gly Met Asp 
145 150 155 160 

aag ggc tgg ccg acg ate etc cag teg ttc cag gac aag ate gac gag 528 
Lys Gly Trp Pro Thr lie Leu Gin Ser Phe Gin Asp Lys lie Asp Glu 
165 170 175 

gaa ggg gcg aag aag tga 546 
Glu Gly Ala Lys Lys * 
180 



<210> 2 
<211> 181 
<212> PRT 
<213> Bacteria 

<400> 2 

Met Thr Gin Glu 
1 

Ser Thr Ala Ala 
20 

Arg His Ser Val 
35 

Phe Leu Glu Gly 
50 

Lys Val Gly Ala 
65 

Glu lie Asp Glu 

Val Asp Glu Pro 
100 

Gly Arg lie Asp 
115 

Ala Asp Gly Gin 
130 

Asp Arg Met Gly 
145 

Lys Gly Trp Pro 

Glu Gly Ala Lys 
180 



Lys Thr Ala Pro 
5 

Lys Lys Pro Lys 

Thr Val Lys Ala 
40 

Phe Pro Glu Trp 
55 

Pro Leu Gly Val 
70 

Gin Gly Glu Glu 
85 

Asp Thr Leu Val 

Pro Asp Asn Ser 
120 

Lys Lys Thr Arg 
135 

Thr Lys His Ala 
150 

Thr lie Leu Gin 

165 

Lys 



Ala Ala Lys Ser 
10 

Pro Pro Asn Tyr 
25 

Asp Arg Lys Thr 

Trp Pro Asn Asn 
60 

Asp Lys Lys Gly 
75 

His Thr Phe Gly 
90 

lie Gly Trp Arg 
105 

Ser Glu Phe Thr 

Val Asp Val Glu 
140 

Lys Arg Val Arg 
155 

Ser Phe Gin Asp 
170 



Thr Thr Thr Lys 
15 

Asp Pro Phe Val 
30 

Ala Phe Lys Thr 
45 

Phe Arg Thr Thr 

Gly Arg Trp Tyr 
80 

Leu lie Arg Lys 
95 

Leu Asn Gly Phe 
110 

Val Thr Phe Val 
125 

His Thr His Phe 

Asn Gly Met Asp 
160 

Lys lie Asp Glu 
175 



<210> 3 
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<211> 1155 
<212> DNA 
<213> Bacteria 
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240 



288 



<220> 
<221> CDS 
<222> (1) . . . (1155) 

<400> 3 

atg gca act age gag agg ggt gtc atg ate ccg ctg tec aag gtc gec 
Met Ala Thr Ser Glu Arg Gly Val Met lie Pro Leu Ser Lys Val Ala 
15 10 15 

atg tct ccg gac gtc age acc cgc gtc tec gec gtc ctg age agt ggc 
Met Ser Pro Asp Val Ser Thr Arg Val Ser Ala Val Leu Ser Ser Gly 
20 25 30 

egg ctg gag cac ggg ccg acc gtc gec gag tac gag gcg gee gtg ggc 144 
Arg Leu Glu His Gly Pro Thr Val Ala Glu Tyr Glu Ala Ala Val Gly 
35 40 45 

agt cgt ate ggc aac ccc egg gtg gtc teg gtc aac tgc ggc acg gee 192 
Ser Arg lie Gly Asn Pro Arg Val Val Ser Val Asn Cys Gly Thr Ala 
50 55 60 

ggg etc cac ctg gcg ctg age etc gec gcg egg ccg ggg gec ggc gag 
Gly Leu His Leu Ala Leu Ser Leu Ala Ala Arg Pro Gly Ala Gly Glu 
65 70 75 80 

teg gag cac gac ggc ccg ggc gag gtg etc acc acg ccg ctg acc ttc 
Ser Glu His Asp Gly Pro Gly Glu Val Leu Thr Thr Pro Leu Thr Phe 
85 90 95 

gag ggc acg aac tgg ccg ate etc gee aac ggg ctg cgc ate egg tgg 336 
Glu Gly Thr Asn Trp Pro lie Leu Ala Asn Gly Leu Arg He Arg Trp 
100 105 HO 

gtg gac gtc gac ccg gee acc etc aac atg gac etc gac gac ctg gee 384 
Val Asp Val Asp Pro Ala Thr Leu Asn Met Asp Leu Asp Asp Leu Ala 
115 120 125 

gcg aag ate teg ccc gee acc egg gec ate gtg gtg gtc cac tgg etc 432 
Ala Lys He Ser Pro Ala Thr Arg Ala He Val Val Val His Trp Leu 
130 135 140 

ggc tac ccg gtg gac etc aac egg ctg cgc gec gtc gtg gac egg gec 48 0 
Gly Tyr Pro Val Asp Leu Asn Arg Leu Arg Ala Val Val Asp Arg Ala 
145 150 155 160 

acg gcg gga tac gac cgc cgc ccg ctg gtc gtg gag gac tgc gcg cag 
Thr Ala Gly Tyr Asp Arg Arg Pro Leu Val Val Glu Asp Cys Ala Gin 
165 170 175 

gcg tgg ggc gee acc tac egg ggc gcg ccg ctg ggc acg cac ggc aac 576 
Ala Trp Gly Ala Thr Tyr Arg Gly Ala Pro Leu Gly Thr His Gly Asn 
180 185 190 



528 



gtc tgc gtg tac age acc ggc gcg ate aag ate ctg acg acc ggc age 



624 
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Val Cys Val Tyr Ser Thr Gly Ala He Lys He Leu Thr Thr Gly Ser 
195 200 205 



ggc ggc ttc gtc gtg ctg ccc gac gac gac ctg tac gac egg etc egg 6 72 
Gly Gly Phe Val Val Leu Pro Asp Asp Asp Leu Tyr Asp Arg Leu Arg 
210 215 220 

ctg cgc cgc tgg etc ggc ate gag egg gcg teg gac egg ate ace ggc 72 0 
Leu Arg Arg Trp Leu Gly He Glu Arg Ala Ser Asp Arg He Thr Gly 
225 230 235 240 

gac tac gac gtc gee gag tgg ggc tac egg ttc ate etc aac gag ate 768 
Asp Tyr Asp Val Ala Glu Trp Gly Tyr Arg Phe He Leu Asn Glu He 
245 250 255 

ggc ggg gcg ate ggc ctg tec aac ctg gaa cgc gtc gac gag ctg ctg 816 
Gly Gly Ala He Gly Leu Ser Asn Leu Glu Arg Val Asp Glu Leu Leu 
260 265 270 

cgc egg cac egg gag aac gec gcg ttc tac gac aag gaa ctg gec ggc 864 
Arg Arg His Arg Glu Asn Ala Ala Phe Tyr Asp Lys Glu Leu Ala Gly 
275 280 285 

ate gac ggc gtc gag cag ace gag egg gec gac gac egg gag ccc gcg 912 
He Asp Gly Val Glu Gin Thr Glu Arg Ala Asp Asp Arg Glu Pro Ala 
290 295 300 

ttc tgg atg tac ccg ctg aag gtc cgc gac cgt ccc gec ttc atg cgc 96 0 
Phe Trp Met Tyr Pro Leu Lys Val Arg Asp Arg Pro Ala Phe Met Arg 
305 310 315 320 

egg ctg etc gac gec ggc ate gec ace age gtc gtg teg cgc cgc aac 
Arg Leu Leu Asp Ala Gly He Ala Thr Ser Val Val Ser Arg Arg Asn 
325 330 335 

gac gcg cac age tgc gtc gcg teg gec cgc ace ace ctg ccc ggg ctg 
Asp Ala His Ser Cys Val Ala Ser Ala Arg Thr Thr Leu Pro Gly Leu 
340 345 350 

gac egg gtg gcg gac cgc gtg gtc cac ate ccg gtg ggc tgg tgg etc 1104 
Asp Arg Val Ala Asp Arg Val Val His He Pro Val Gly Trp Trp Leu 
355 360 365 

ace gag gac gac cgc tec cac gtc gtc gaa acg ate aag tec ggc tgg 1152 
Thr Glu Asp Asp Arg Ser His Val Val Glu Thr He Lys Ser Gly Trp 
370 375 380 



1008 



1056 



tga 
* 



1155 



<210> 4 
<211> 384 
<212> PRT 
<213> Bacteria 

<400> 4 
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Met Ala Thr Ser 
1 

Met Ser Pro Asp 
20 

Arg Leu Glu His 
35 

Ser Arg lie Gly 
50 

Gly Leu His Leu 
65 

Ser Glu His Asp 

Glu Gly Thr Asn 
100 

Val Asp Val Asp 
115 

Ala Lys lie Ser 
130 

Gly Tyr Pro Val 
145 

Thr Ala Gly Tyr 

Ala Trp Gly Ala 
180 

Val Cys Val Tyr 
195 

Gly Gly Phe Val 
210 

Leu Arg Arg Trp 
225 

Asp Tyr Asp Val 

Gly Gly Ala lie 
260 

Arg Arg His Arg 
275 

lie Asp Gly Val 
290 

Phe Trp Met Tyr 
305 

Arg Leu Leu Asp 

Asp Ala His Ser 
340 

Asp Arg Val Ala 
355 

Thr Glu Asp Asp 
370 



Glu Arg Gly Val 
5 

Val Ser Thr Arg 

Gly Pro Thr Val 
40 

Asn Pro Arg Val 
55 

Ala Leu Ser Leu 
70 

Gly Pro Gly Glu 
85 

Trp Pro lie Leu 

Pro Ala Thr Leu 
120 

Pro Ala Thr Arg 
135 

Asp Leu Asn Arg 
150 

Asp Arg Arg Pro 
165 

Thr Tyr Arg Gly 

Ser Thr Gly Ala 
200 

Val Leu Pro Asp 
215 

Leu Gly He Glu 
230 

Ala Glu Trp Gly 
245 

Gly Leu Ser Asn 

Glu Asn Ala Ala 
280 

Glu Gin Thr Glu 
295 

Pro Leu Lys Val 
310 

Ala Gly He Ala 
325 

Cys Val Ala Ser 

Asp Arg Val Val 
360 

Arg Ser His Val 
375 



Met He Pro Leu 
10 

Val Ser Ala Val 
25 

Ala Glu Tyr Glu 

Val Ser Val Asn 
60 

Ala Ala Arg Pro 
75 

Val Leu Thr Thr 
90 

Ala Asn Gly Leu 
105 

Asn Met Asp Leu 

Ala He Val Val 
140 

Leu Arg Ala Val 
155 

Leu Val Val Glu 
170 

Ala Pro Leu Gly 
185 

He Lys He Leu 

Asp Asp Leu Tyr 
220 

Arg Ala Ser Asp 
235 

Tyr Arg Phe He 
250 

Leu Glu Arg Val 
265 

Phe Tyr Asp Lys 

Arg Ala Asp Asp 
300 

Arg Asp Arg Pro 
315 

Thr Ser Val Val 
330 

Ala Arg Thr Thr 
345 

His He Pro Val 

Val Glu Thr He 
380 



Ser Lys Val Ala 
15 

Leu Ser Ser Gly 
30 

Ala Ala Val Gly 
45 

Cys Gly Thr Ala 

Gly Ala Gly Glu 
80 

Pro Leu Thr Phe 
95 

Arg He Arg Trp 
110 

Asp Asp Leu Ala 
125 

Val His Trp Leu 

Val Asp Arg Ala 
160 

Asp Cys Ala Gin 
175 

Thr His Gly Asn 
190 

Thr Thr Gly Ser 
205 

Asp Arg Leu Arg 

Arg He Thr Gly 
240 

Leu Asn Glu He 
255 

Asp Glu Leu Leu 
270 

Glu Leu Ala Gly 
285 

Arg Glu Pro Ala 

Ala Phe Met Arg 
320 

Ser Arg Arg Asn 
335 

Leu Pro Gly Leu 
350 

Gly Trp Trp Leu 
365 

Lys Ser Gly Trp 



<210> 5 
<211> 990 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> <!)... (990) 
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<400> 5 

gtg ccc aga tec ctg gtc acc ggc ggc ttc ggc ttc gtc ggc agt cac 
Val Pro Arg Ser Leu Val Thr Gly Gly Phe Gly Phe Val Gly Ser His 
15 10 15 

gtc gtc gaa egg ctg gtc cgc egg ggt gac gag gtc gtc gtc tac gac 
Val Val Glu Arg Leu Val Arg Arg Gly Asp Glu Val Val Val Tyr Asp 
20 25 30 

etc gec gac ccg ccg ccc gac ctg gag cac ccg ccg ggc gcg ate egg 144 
Leu Ala Asp Pro Pro Pro Asp Leu Glu His Pro Pro Gly Ala lie Arg 
35 40 45 

cac gtc cgc ggc gac gtc egg gac gee gac ggg ctg gcg gec gec gec 192 
His Val Arg Gly Asp Val Arg Asp Ala Asp Gly Leu Ala Ala Ala Ala 
50 55 60 

acc ggc gtg gac gag gtc tac cac etc gcg gcg gtc gtc ggc gtc gac 240 
Thr Gly Val Asp Glu Val Tyr His Leu Ala Ala Val Val Gly Val Asp 
65 70 75 80 

egg tac etc age egg ccg ctg gac gtg gtc gag ate aac gtg gac ggc 28 8 
Arg Tyr Leu Ser Arg Pro Leu Asp Val Val Glu lie Asn Val Asp Gly 
85 90 95 

acc egg aac gcg ttg cgc gee gca ctg cgc gee ggt gee egg gtc gtg 336 
Thr Arg Asn Ala Leu Arg Ala Ala Leu Arg Ala Gly Ala Arg Val Val 
100 105 110 

gtg tec age acc age gag gtg tac ggg cgc aat ccg egg gtg ccg tgg 384 
Val Ser Ser Thr Ser Glu Val Tyr Gly Arg Asn Pro Arg Val Pro Tirp 
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115 



120 



125 



ctg cgc egg acc gtc gag tgg gec cgc cgc aac ccg tgg tgg acc gec 
Leu Arg Arg Thr Val Glu Trp Ala Arg Arg Asn Pro Trp Trp Thr Ala 

310 315 320 



305 



cag gec gac gac gga ctg gtc gtc agg tag 
Gin Ala Asp Asp Gly Leu Val Val Arg * 



480 



egg gag gac gac gac egg gtg etc ggc age acg gcg acg gac egg tgg 4 32 
Arg Glu Asp Asp Asp Arg Val Leu Gly Ser Thr Ala Thr Asp Arg Trp 
130 135 140 

teg tac teg acg age aag gcg gcg gec gag cac ctg gec ttc gec ttc 
Ser Tyr Ser Thr Ser Lys Ala Ala Ala Glu His Leu Ala Phe Ala Phe 
145 150 155 160 

cac egg cag gag ggc ctg ccg gtg acg gtg ctg egg tac ttc aac gtc 528 
His Arg Gin Glu Gly Leu Pro Val Thr Val Leu Arg Tyr Phe Asn Val 
165 170 175 

tac ggc cca cgc cag cgc ccg gcg tac gtc etc age cgc acc gtc gec 576 
Tyr Gly Pro Arg Gin Arg Pro Ala Tyr Val Leu Ser Arg Thr Val Ala 
180 185 190 

cgc ctg ctg egg ggc gtt ccg ccc gtg gtg tac gac gac ggc cgc cag 624 
Arg Leu Leu Arg Gly Val Pro Pro Val Val Tyr Asp Asp Gly Arg Gin 
195 200 205 

acg egg tgc ttc acc tgg ate gac gag gcg gec gag gcg acc ctg ctg 
Thr Arg Cys Phe Thr Trp lie Asp Glu Ala Ala Glu Ala Thr Leu Leu 
210 215 220 

gee gee gec cac ccg egg gec gtc ggc gag tgt ttc aac ate ggc age 
Ala Ala Ala His Pro Arg Ala Val Gly Glu Cys Phe Asn He Gly Ser 
225 230 235 240 

age gtg gag acc acc gtc gec gag gcg gtc egg ctg gec ggc acg gtg 76 8 
Ser Val Glu Thr Thr Val Ala Glu Ala Val Arg Leu Ala Gly Thr Val 
245 250 255 

gee ggg gtg ccg gtg gcg gec cag acc gcg gac acc gga gec ggg etc 816 
Ala Gly Val Pro Val Ala Ala Gin Thr Ala Asp Thr Gly Ala Gly Leu 
260 265 270 



672 



720 



ggc gec cgc tac cag gac att ccc cgc cgc gta ccg gac tgc ggc aag 864 
Gly Ala Arg Tyr Gin Asp He Pro Arg Arg Val Pro Asp Cys Gly Lys 
275 280 285 

gec gec gcg ctg ctg gac tgg egg gec egg gtg ccg ctg gtg acc ggc 912 
Ala Ala Ala Leu Leu Asp Trp Arg Ala Arg Val Pro Leu Val Thr Gly 
290 295 300 



960 



990 



325 



<210> 6 
<211> 329 
<212> PRT 
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<213> Bacteria 



<400> 6 

Met Pro Arg Ser 

1 

Val Val Glu Arg 
20 

Leu Ala Asp Pro 
35 

His Val Arg Gly 
50 

Thr Gly Val Asp 
65 

Arg Tyr Leu Ser 

Thr Arg Asn Ala 
100 

Val Ser Ser Thr 
115 

Arg Glu Asp Asp 
130 

Ser Tyr Ser Thr 
145 

His Arg Gin Glu 

Tyr Gly Pro Arg 
180 

Arg Leu Leu Arg 
195 

Thr Arg Cys Phe 
210 

Ala Ala Ala His 
225 

Ser Val Glu Thr 

Ala Gly Val Pro 
260 

Gly Ala Arg Tyr 
275 

Ala Ala Ala Leu 
290 

Leu Arg Arg Thr 
305 

Gin Ala Asp Asp 



Leu Val Thr Gly 
5 

Leu Val Arg Arg 

Pro Pro Asp Leu 
40 

Asp Val Arg Asp 
55 

Glu Val Tyr His 
70 

Arg Pro Leu Asp 
85 

Leu Arg Ala Ala 

Ser Glu Val Tyr 
120 

Asp Arg Val Leu 
135 

Ser Lys Ala Ala 
150 

Gly Leu Pro Val 
165 

Gin Arg Pro Ala 

Gly Val Pro Pro 
200 

Thr Trp lie Asp 
215 

Pro Arg Ala Val 
230 

Thr Val Ala Glu 
245 

Val Ala Ala Gin 

Gin Asp lie Pro 
280 

Leu Asp Trp Arg 
295 

Val Glu Trp Ala 
310 

Gly Leu Val Val 
325 



Gly Phe Gly Phe 
10 

Gly Asp Glu Val 
25 

Glu His Pro Pro 

Ala Asp Gly Leu 
60 

Leu Ala Ala Val 
75 

Val Val Glu lie 
90 

Leu Arg Ala Gly 
105 

Gly Arg Asn Pro 

Gly Ser Thr Ala 
140 

Ala Glu His Leu 
155 

Thr Val Leu Arg 
170 

Tyr Val Leu Ser 
185 

Val Val Tyr Asp 

Glu Ala Ala Glu 
220 

Gly Glu Cys Phe 
235 

Ala Val Arg Leu 
250 

Thr Ala Asp Thr 
265 

Arg Arg Val Pro 

Ala Arg Val Pro 
300 

Arg Arg Asn Pro 

315 
Arg 



Val Gly Ser His 
15 

Val Val Tyr Asp 
30 

Gly Ala He Arg 
45 

Ala Ala Ala Ala 

Val Gly Val Asp 
80 

Asn Val Asp Gly 
95 

Ala Arg Val Val 
110 

Arg Val Pro Trp 
125 

Thr Asp Arg Trp 

Ala Phe Ala Phe 
160 

Tyr Phe Asn Val 
175 

Arg Thr Val Ala 
190 

Asp Gly Arg Gin 
205 

Ala Thr Leu Leu 

Asn He Gly Ser 
240 

Ala Gly Thr Val 
255 

Gly Ala Gly Leu 
270 

Asp Cys Gly Lys 
285 

Leu Val Thr Gly 

Trp Trp Thr Ala 
320 



<210> 7 
<211> 987 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (987) 

<400> 7 

atg aca acc aat ccg gcc ttg gcc ate gag acc cgc gat ctg gtg aag 
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Met Thr Thr Asn Pro Ala Leu Ala He Glu Thr Arg Asp Leu Val Lys 
1 5 10 15 

gtc ttc ggc cag acg cgc gcg gtc gac gga ctg gac ctg gtg gtg egg 
Val Phe Gly Gin Thr Arg Ala Val Asp Gly Leu Asp Leu Val Val Arg 
20 25 30 

gec ggg acg ate cac ggg gtg ctg ggc ccg aac ggc gec ggc aag acg 
Ala Gly Thr He His Gly Val Leu Gly Pro Asn Gly Ala Gly Lys Thr 
35 40 45 

acg gec ate aag atg etc gec acg ctg atg cga ccc ace tec ggc acc 
Thr Ala He Lys Met Leu Ala Thr Leu Met Arg Pro Thr Ser Gly Thr 
50 55 60 

gcg tec gtg ctg ggg cac gac gtg gtc cgc gag gee gee gag gtc egg 
Ala Ser Val Leu Gly His Asp Val Val Arg Glu Ala Ala Glu Val Arg 
65 70 75 80 

cgc cgc ate ggc etc acc ggc cag acc atg tec gtc gac gag gac atg 
Arg Arg He Gly Leu Thr Gly Gin Thr Met Ser Val Asp Glu Asp Met 
85 90 95 

acc ggc gtg cag aac ctg ate etc gec ggc cgc ctg cag ggt ctg egg 33 6 
Thr Gly Val Gin Asn Leu He Leu Ala Gly Arg Leu Gin Gly Leu Arg 
100 105 HO 



cac gcg tec gcg gee gcg egg gcg gag cag ttg atg gag gcg ttc gac 
His Ala Ser Ala Ala Ala Arg Ala Glu Gin Leu Met Glu Ala Phe Asp 
115 120 125 

etc acc gag gtc ggc ggc egg ctg gtg aag acc ttc tec ggc ggg cag 
Leu Thr Glu Val Gly Gly Arg Leu Val Lys Thr Phe Ser Gly Gly Gin 
130 135 140 

egg egg cgc ate gac gtg gee gcg age atg gtg gtc acc ccc gag ctg 
Arg Arg Arg He Asp Val Ala Ala Ser Met Val Val Thr Pro Glu Leu 
145 150 155 160 

ctg ttc etc gac gag ccg acc acc ggc etc gac ccg cgc age cgc age 
Leu Phe Leu Asp Glu Pro Thr Thr Gly Leu Asp Pro Arg Ser Arg Ser 
165 170 175 

gag gtc tgg gag atg ate egg gcg ctg gtc egg gac ggg ggc acc gtc 
Glu Val Trp Glu Met He Arg Ala Leu Val Arg Asp Gly Gly Thr Val 
180 185 190 

ctg ctg acc acg cag tac etc gac gag gcg gac cac etc gec gac gag 
Leu Leu Thr Thr Gin Tyr Leu Asp Glu Ala Asp His Leu Ala Asp Glu 
195 200 205 

ctg acg etc ate gac cac ggc cgc ate gtg gcg cag ggc acc ccg ccg 
Leu Thr Leu He Asp His Gly Arg He Val Ala Gin Gly Thr Pro Pro 
210 215 220 

gag ctg aag gcg age cgc gee gee ggc gtg etc gac gtg egg ctg cgt 
Glu Leu Lys Ala Ser Arg Ala Ala Gly Val Leu Asp Val Arg Leu Arg 
225 230 235 240 



96 



144 



192 



240 



288 



384 



432 



480 



528 



576 



624 



672 



720 
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gac ccc gag cgc egg gec gac gcg ggc gec ctg etc gec aag gec gtc 
Asp Pro Glu Arg Arg Ala Asp Ala Gly Ala Leu Leu Ala Lys Ala Val 
245 250 255 



768 



ggc gec gec gee gac etc gac tec gat ccg gcg egg ctg teg gtg egg 
Gly Ala Ala Ala Asp Leu Asp Ser Asp Pro Ala Arg Leu Ser Val Arg 
260 265 270 



gtg ace 


gac 


ccc 


gac 


egg gcg 


gcg 


ctg 


gee 


ctg 




gag 


ctg 


gcg 


egg 


Val Thr 


Asp 


Pro 


Asp 


Arg Ala 


Ala 


Leu 


Ala 


Leu Gly Glu 


Leu 


Ala 


Arg 




275 








280 










285 








gee ggc 


ate 


cac 


gtc 


gac gac 


ttc 


acg 


etc 


ggc 


cag 


ccc 


teg 


etc 


gac 


Ala Gly 


lie 


His 


Val 


Asp Asp 


Phe 


Thr 


Leu 


Gly 


Gin 


Pro 


Ser 


Leu 


Asp 


290 








295 










300 










acg gtg 


ttc 


etc 


gee 


etc ace 


ggt 


cac 


tCQ 


acg 


gtc 


gac 


gee 


age 


gaa 


Thr Val 


Phe 


Leu 


Ala Leu Thr Gly His 


Ser 


Thr 


Val 


Asp 


Ala 


Ser 


Glu 


305 








310 








315 










320 


gaa gag 


gaa 


gca 


gag 


gta egg 


gca 


uga 
















Glu Glu 


Glu 


Ala 


Glu 


Val Arg 


Ala 


-* 






















325 






















<210> 8 




























<211> 328 


























<212> PRT 


























<213> Bacteria 
























<400> 8 




























Met Thr 


Thr 


Asn 


Pro 


Ala Leu 


Ala 


He 


Glu 


Thr 


Arg 


Asp 


Leu 


Val 


Lys 


1 






5 








10 










15 




Val Phe 


Gly 


Gin 


Thr 


Arg Ala 


Val 


Asp 


Gly 


Leu 


Asp 


Leu 


Val 


Val 


Arg 






20 








25 










30 






Ala Gly 


Thr 


He 


His 


Gly Val 


Leu 


Gly 


Pro 


Asn 


Gly Ala 


Gly 


Lys 


Thr 


35 








40 










45 








Thr Ala 


He 


Lys 


Met 


Leu Ala 


Thr 


Leu 


Met 


Arg 


Pro 


Thr 


Ser 


Gly 


Thr 


50 








55 










60 










Ala Ser 


Val 


Leu 


Gly 


His Asp 


Val 


Val 


Arg 


Glu 


Ala 


Ala 


Glu 


Val 


Arg 


65 








70 








75 










80 


Arg Arg 


He 


Gly Leu Thr Gly Gin 


Thr 


Met 


Ser 


Val 


Asp 


Glu 


Asp 


Met 








85 








90 










95 




Thr Gly 


Val 


Gin 


Asn 


Leu He 


Leu 


Ala 


Gly Arg 


Leu 


Gin 


Gly 


Leu 


Arg 




100 








105 










110 






His Ala 


Ser 


Ala 


Ala 


Ala Arg 


Ala 


Glu 


Gin 


Leu 


Met 


Glu 


Ala 


Phe 


Asp 




115 








120 










125 








Leu Thr 


Glu 


Val 


Gly 


Gly Arg 


Leu 


Val 


Lys 


Thr 


Phe 


Ser 


Gly 


Gly 


Gin 


130 








135 










140 










Arg Arg 


Arg 


He 


Asp 


Val Ala 


Ala 


Ser 


Met 


Val 


Val 


Thr 


Pro 


Glu 


Leu 


145 








150 








155 










160 


Leu Phe 


Leu 


Asp 


Glu 


Pro Thr 


Thr 


Gly 


Leu 


Asp 


Pro 


Arg 


Ser 


Arg 


Ser 








165 








170 










175 




Glu Val 


Trp 


Glu 


Met 


He Arg 


Ala 


Leu 


Val 


Arg 


Asp 


Gly Gly 


Thr 


Val 






180 








185 










190 






Leu Leu 


Thr 


Thr 


Gin 


Tyr Leu 


Asp 


Glu 


Ala 


Asp 


His 


Leu 


Ala 


Asp 


Glu 




195 








200 










205 









864 



912 



960 



987 
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Leu Thr Leu He 
210 

Glu Leu Lys Ala 
225 

Asp Pro Glu Arg 

Gly Ala Ala Ala 
260 

Val Thr Asp Pro 
275 

Ala Gly He His 
290 

Thr Val Phe Leu 
305 

Glu Glu Glu Ala 



Asp His Gly Arg 
215 

Ser Arg Ala Ala 
230 

Arg Ala Asp Ala 
245 

Asp Leu Asp Ser 

Asp Arg Ala Ala 
280 

Val Asp Asp Phe 
295 

Ala Leu Thr Gly 
310 

Glu Val Arg Ala 
325 



He Val Ala Gin 
220 

Gly Val Leu Asp 
235 

Gly Ala Leu Leu 
250 

Asp Pro Ala Arg 
265 

Leu Ala Leu Gly 

Thr Leu Gly Gin 
300 

His Ser Thr Val 
315 



Gly Thr Pro Pro 

Val Arg Leu Arg 
240 

Ala Lys Ala Val 
255 

Leu Ser Val Arg 
270 

Glu Leu Ala Arg 
285 

Pro Ser Leu Asp 

Asp Ala Ser Glu 
320 



<210> 9 
<211> 1686 
<212> DNA 
<213> Bacteria 

<220> 

<221> misc_f eature 
<222> {!).., (1686) 
<223> n = A,T,C or G 

<221> mi sc_f eature 
<222> (1) . . . (1686) 
<223> n = A,T,C or G 

<221> CDS 

<222> (1) . . . (1686) 



ctg ccg gcc gag ate ttc 48 
Leu Pro Ala Glu He Phe 
15 



atg acg aca ccc age acc gag gtg egg ccg 
Met Thr Thr Pro Ser Thr Glu Val Arg Pro 
15 10 



age cga teg gtg gcc ggc gcg gaa egg ccg cca cgc ccc ggc ccg ctg 96 
Ser Arg Ser Val Ala Gly Ala Glu Arg Pro Pro Arg Pro Gly Pro Leu 
20 25 30 

ttc gcc gtc cgc acc ttc gcc tgg egg aac ctg ate aag etc egg tac 144 
Phe Ala Val Arg Thr Phe Ala Trp Arg Asn Leu He Lys Leu Arg Tyr 
35 40 45 

gtg cag gac cac ctg ggc acc gcg gtg gtc ttc ccg ate ate ctg acg 192 
Val Gin Asp His Leu Gly Thr Ala Val Val Phe Pro He He Leu Thr 
50 55 60 

ctg gtc ttc acc tat ctg etc ggc ggc gcg ate gcc ggc teg ccc egg 240 
Leu Val Phe Thr Tyr Leu Leu Gly Gly Ala He Ala Gly Ser Pro Arg 
65 70 75 80 

gag tac ctg cag ttc ttc ctt ccc ggc gtg ate gtc etc teg etc gtg 288 
Glu Tyr Leu Gin Phe Phe Leu Pro Gly Val He Val Leu Ser Leu Val 
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85 90 95 

teg teg age atg atg age gee ctg acg ctg aac egg gac ate gee ace 33 6 
Ser Ser Ser Met Met Ser Ala Leu Thr Leu Asn Arg Asp lie Ala Thr 
100 105 HO 



ggc atg ttc gac egg gtc cgc age acg ccc ate tgg cag ccc gcg gta 
Gly Met Phe Asp Arg Val Arg Ser Thr Pro He Trp Gin Pro Ala Val 
115 120 125 

ctg gtc ggg gcg atg gee ggc gac gee gtc egg tac gee ctg acc teg 
Leu Val Gly Ala Met Ala Gly Asp Ala Val Arg Tyr Ala Leu Thr Ser 
130 135 140 

ate gtg ccg ctg teg etc ggc ctg ctg etc ggc ttc egg ccg gac ggc 
He Val Pro Leu Ser Leu Gly Leu Leu Leu Gly Phe Arg Pro Asp Gly 
145 150 155 160 

ggc ctg tec ggg gtg gtg etc gee ctg etc tac ctg cag ctg ttc acc 
Gly Leu Ser Gly Val Val Leu Ala Leu Leu Tyr Leu Gin Leu Phe Thr 
165 170 175 

ttc age gtc gee tgg ctg tgg atg ctg ttc gcg gtg ctg ate ccg cag 
Phe Ser Val Ala Trp Leu Trp Met Leu Phe Ala Val Leu He Pro Gin 



180 



185 190 



cat ccc gee gee ggt egg cgc tgt gee ccg gee gca ccc gga gee ttt 
His Pro Ala Ala Gly Arg Arg Cys Ala Pro Ala Ala Pro Gly Ala Phe 
305 310 315 320 



384 



432 



480 



528 



576 



624 



672 



720 



ccg acc gec gec gec ggc gtg gtg aac etc ctg cag ttc gtg etc etc 
Pro Thr Ala Ala Ala Gly Val Val Asn Leu Leu Gin Phe Val Leu Leu 
195 200 205 

ttc ggc age aac ate ctg gcg ccg teg cag acg atg ccg ggc tgg ctg 
Phe Gly Ser Asn He Leu Ala Pro Ser Gin Thr Met Pro Gly Trp Leu 
210 215 220 

gag gcg gtg gtc aag ttg aac ccc gtc acc cac gee gcg acc gee acc 
Glu Ala Val Val Lys Leu Asn Pro Val Thr His Ala Ala Thr Ala Thr 
225 230 235 240 

cgc ggg ctg ntg cac ggc acg gtg acc teg ggg gag atg ggc gen ggc 768 
Arg Gly Leu Xaa His Gly Thr Val Thr Ser Gly Glu Met Gly Ala Gly 
245 250 255 

ctg ctg acc tgc gec gtg etc ate gtg get get cgc ccc gee cac gat 816 
Leu Leu Thr Cys Ala Val Leu He Val Ala Ala Arg Pro Ala His Asp 
260 265 270 

ctg get eta cag ccg caa gca gcg ctg aca ccc etc ccc gac ggc ccc 864 
Leu Ala Leu Gin Pro Gin Ala Ala Leu Thr Pro Leu Pro Asp Gly Pro 
275 280 285 

ggt gtg ccc cct gtt etc etc gca ggg gca ggc ccg ggg ccg teg egg 912 
Gly Val Pro Pro Val Leu Leu Ala Gly Ala Gly Pro Gly Pro Ser Arg 
290 295 300 



960 
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gcc gcg ccg gcc acc get get gcg gec gta acc gec cgc tgt gtc ggt 1008 
Ala Ala Pro Ala Thr Ala Ala Ala Ala Val Thr Ala Arg Cys Val Gly 
325 330 335 

cac egg cgc cgt ggc ggc gca ccg tgt egg ggc egg ctg ccc act tgt 1056 
His Arg Arg Arg Gly Gly Ala Pro Cys Arg Gly Arg Leu Pro Thr Cys 
340 345 350 

ggc cgc cgt gcg gtc ggc gga egg acg gcg gcc ccg gac gga cat gag 1104 
Gly Arg Arg Ala Val Gly Gly Arg Thr Ala Ala Pro Asp Gly His Glu 
355 360 365 

tec gtc egg ggc cgc gtc gtg gtc ggt cga gca get ccc gac egg eta 1152 
Ser Val Arg Gly Arg Val Val Val Gly Arg Ala Ala Pro Asp Arg Leu 
370 375 380 

cga gcc gga gtg gac cag cgc ggc cca ggc etc gcc gcg etc ctg cga 12 0 0 
Arg Ala Gly Val Asp Gin Arg Gly Pro Gly Leu Ala Ala Leu Leu Arg 
385 390 395 400 

gat ccg cat cat etc ggg get egg etc gaa etc etc ggc cgc gtc etc 1248 
Asp Pro His His Leu Gly Ala Arg Leu Glu Leu Leu Gly Arg Val Leu 
405 410 415 

egg gac egg gcc gcc egg cag gtc gcc ccg gat gaa cat acc gag gaa 1296 
Arg Asp Arg Ala Ala Arg Gin Val Ala Pro Asp Glu His Thr Glu Glu 
420 425 430 

gtc gag cgc cat etc cca gcc gac gcc gac etc cac gag cat ctg etc 1344 
Val Glu Arg His Leu Pro Ala Asp Ala Asp Leu His Glu His Leu Leu 
435 440 445 

cga ggt cgt ggc gtg etc cag etc cag cag ggt gcc gtc gcc etc etc 13 92 
Arg Gly Arg Gly Val Leu Gin Leu Gin Gin Gly Ala Val Ala Leu Leu 
450 455 460 



gtc age cgc age tec acc teg ctg tec ggc ttg ccc teg tac acc cag 
Val Ser Arg Ser Ser Thr Ser Leu Ser Gly Leu Pro Ser Tyr Thr Gin 
465 470 475 480 

ctg atg gtg age egg cgc ggt ggc teg cag cgc agg atg teg ccg ctg 
Leu Met Val Ser Arg Arg Gly Gly Ser Gin Arg Arg Met Ser Pro Leu 
485 490 495 

gcg ttg ccc tgc agg gcg aag ttg cca ccc teg egg agg teg ccc ttg 
Ala Leu Pro Cys Arg Ala Lys Leu Pro Pro Ser Arg Arg Ser Pro Leu 
500 505 510 

ggc teg atg aac cag egg ttg atg egg ttc ggg teg gtg cag gcg etc 
Gly Ser Met Asn Gin Arg Leu Met Arg Phe Gly Ser Val Gin Ala Leu 
515 520 525 

cag acc teg teg acg ggc gcg teg tac cgt tgc egg atg atg atg ctg 
Gin Thr Ser Ser Thr Gly Ala Ser Tyr Arg Cys Arg Met Met Met Leu 
530 535 540 

egg gcc teg ccg gcc ggg atg gtg cgc egg ccg agg gca cgc tec gtc 
Arg Ala Ser Pro Ala Gly Met Val Arg Arg Pro Arg Ala Arg Ser Val 



1440 



1488 



1536 



1584 



1632 



1680 
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545 



550 



555 



560 



gcc tga 
Ala * 



1686 



<210> 10 
<211> 561 
<212> PRT 
<213> Bacteria 



<220> 

<221> VARIANT 

<222> (1) . . . (561) 

<223> Xaa = Any Amino Acid 



<221> VARIANT 

<222> (1) . . . (561) 

<22 3> Xaa = Any Amino Acid 



<400> 10 
Met Thr Thr Pro 
1 

Ser Arg Ser Val 
20 

Phe Ala Val Arg 
35 

Val Gin Asp His 
50 

Leu Val Phe Thr 
65 

Glu Tyr Leu Gin 

Ser Ser Ser Met 
100 

Gly Met Phe Asp 
115 

Leu Val Gly Ala 
130 

lie Val Pro Leu 
145 

Gly Leu Ser Gly 

Phe Ser Val Ala 
180 

Pro Thr Ala Ala 
195 

Phe Gly Ser Asn 
210 

Glu Ala Val Val 
225 

Arg Gly Leu Xaa 

Leu Leu Thr Cys 
260 

Leu Ala Leu Gin 
275 



Ser Thr Glu Val 
5 

Ala Gly Ala Glu 

Thr Phe Ala Trp 
40 

Leu Gly Thr Ala 
55 

Tyr Leu Leu Gly 
70 

Phe Phe Leu Pro 
85 

Met Ser Ala Leu 

Arg Val Arg Ser 
120 

Met Ala Gly Asp 
135 

Ser Leu Gly Leu 
150 

Val Val Leu Ala 
165 

Trp Leu Trp Met 

Ala Gly Val Val 
200 

lie Leu Ala Pro 
215 

Lys Leu Asn Pro 
230 

His Gly Thr Val 
245 

Ala Val Leu lie 

Pro Gin Ala Ala 
280 



Arg Pro Leu Pro 
10 

Arg Pro Pro Arg 
25 

Arg Asn Leu lie 

Val Val Phe Pro 
60 

Gly Ala lie Ala 
75 

Gly Val lie Val 
90 

Thr Leu Asn Arg 
105 

Thr Pro lie Trp 

Ala Val Arg Tyr 
140 

Leu Leu Gly Phe 
155 

Leu Leu Tyr Leu 
170 

Leu Phe Ala Val 
185 

Asn Leu Leu Gin 

Ser Gin Thr Met 

220 

Val Thr His Ala 
235 

Thr Ser Gly Glu 
250 

Val Ala Ala Arg 
265 

Leu Thr Pro Leu 



Ala Glu He Phe 
15 

Pro Gly Pro Leu 
30 

Lys Leu Arg Tyr 
45 

He He Leu Thr 

Gly Ser Pro Arg 
80 

Leu Ser Leu Val 
95 

Asp He Ala Thr 
110 

Gin Pro Ala Val 
125 

Ala Leu Thr Ser 

Arg Pro Asp Gly 
160 

Gin Leu Phe Thr 
175 

Leu He Pro Gin 
190 

Phe Val Leu Leu 
205 

Pro Gly Trp Leu 

Ala Thr Ala Thr 
240 

Met Gly Ala Gly 
255 

Pro Ala His Asp 
270 

Pro Asp Gly Pro 
285 



DC01 348871 v 1 



17 



Gly Val Pro Pro 
290 

His Pro Ala Ala 
305 

Ala Ala Pro Ala 

His Arg Arg Arg 
340 

Gly Arg Arg Ala 
355 

Ser Val Arg Gly 
370 

Arg Ala Gly Val 
385 

Asp Pro His His 

Arg Asp Arg Ala 
420 

Val Glu Arg His 
435 

Arg Gly Arg Gly 
450 

Val Ser Arg Ser 
465 

Leu Met Val Ser 

Ala Leu Pro Cys 
500 

Gly Ser Met Asn 
515 

Gin Thr Ser Ser 
530 

Arg Ala Ser Pro 

545 

Ala 



Val Leu Leu Ala 
295 

Gly Arg Arg Cys 
310 

Thr Ala Ala Ala 
325 

Gly Gly Ala Pro 

Val Gly Gly Arg 
360 

Arg Val Val Val 
375 

Asp Gin Arg Gly 
390 

Leu Gly Ala Arg 
405 

Ala Arg Gin Val 

Leu Pro Ala Asp 
440 

Val Leu Gin Leu 
455 

Ser Thr Ser Leu 
470 

Arg Arg Gly Gly 
485 

Arg Ala Lys Leu 

Gin Arg Leu Met 
520 

Thr Gly Ala Ser 
535 

Ala Gly Met Val 
550 



Gly Ala Gly Pro 
300 

Ala Pro Ala Ala 
315 

Ala Val Thr Ala 
330 

Cys Arg Gly Arg 
345 

Thr Ala Ala Pro 

Gly Arg Ala Ala 
380 

Pro Gly Leu Ala 
395 

Leu Glu Leu Leu 
410 

Ala Pro Asp Glu 
425 

Ala Asp Leu His 

Gin Gin Gly Ala 
460 

Ser Gly Leu Pro 
475 

Ser Gin Arg Arg 
490 

Pro Pro Ser Arg 
505 

Arg Phe Gly Ser 

Tyr Arg Cys Arg 
540 

Arg Arg Pro Arg 
555 



Gly Pro Ser Arg 

Pro Gly Ala Phe 
320 

Arg Cys Val Gly 
335 

Leu Pro Thr Cys 
350 

Asp Gly His Glu 
365 

Pro Asp Arg Leu 

Ala Leu Leu Arg 
400 

Gly Arg Val Leu 
415 

His Thr Glu Glu 
430 

Glu His Leu Leu 
445 

Val Ala Leu Leu 

Ser Tyr Thr Gin 
480 

Met Ser Pro Leu 
495 

Arg Ser Pro Leu 
510 

Val Gin Ala Leu 
525 

Met Met Met Leu 

Ala Arg Ser Val 
560 



<210> 


11 


<211> 


792 


<212> 


DNA 


<213> 


Bacteria 


<220> 




<221> 


CDS 


<222> 


(1) . . . (792) 


<400> 


11 


atg cga tgg agg ctg 


Met Arg Trp Arg Leu 


1 


5 



10 15 

gcg ttc gtg cac gac tea ccg gag gag acg gag acc acc cag cgc ctg 
Ala Phe Val His Asp Ser Pro Glu Glu Thr Glu Thr Thr Gin Arg Leu 
20 25 30 

acg aag etc ttg acc aac tct ccg ate ccc acg gag gaa ctg gtc aac 
Thr Lys Leu Leu Thr Asn Ser Pro He Pro Thr Glu Glu Leu Val Asn 



48 



96 



144 
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35 



40 



45 



aac etc ccc ctg ttc ctg cgc cgc cac cag atg acc gat ctg etc teg 192 
Asn Leu Pro Leu Phe Leu Arg Arg His Gin Met Thr Asp Leu Leu Ser 
50 55 60 



atg gac gcg etc tac cgt cag gtc etc gac gtg ccg ggc gtg ate atg 
Met Asp Ala Leu Tyr Arg Gin Val Leu Asp Val Pro Gly Val He Met 

70 75 80 



65 



240 



288 



gag ttc ggc gtc egg ttc ggc cgt cac etc ggc acg ttc gee gee ctg 
Glu Phe Gly Val Arg Phe Gly Arg His Leu Gly Thr Phe Ala Ala Leu 
85 90 95 

cgc ggt gtc tac gag ccc tac aac ccg ctg cgc cgc ate gtc ggc ttc 336 
Arg Gly Val Tyr Glu Pro Tyr Asn Pro Leu Arg Arg He Val Gly Phe 
100 105 HO 

gac acc ttc acc ggc ttc ccc gac gtc aac gac gtc gac cgc gtc ggc 3 84 
Asp Thr Phe Thr Gly Phe Pro Asp Val Asn Asp Val Asp Arg Val Gly 
115 120 125 

ccc acg gcg tac cag ggc cgc ttc gca gtg ccc ggg ggc tat ccg gcg 432 
Pro Thr Ala Tyr Gin Gly Arg Phe Ala Val Pro Gly Gly Tyr Pro Ala 
130 135 140 

tac ctg aaa gag gtg ctg gac gcg cac gag tgc age gac ttc ttc ggc 480 
Tyr Leu Lys Glu Val Leu Asp Ala His Glu Cys Ser Asp Phe Phe Gly 
145 150 155 160 

cac gtg acg cag cgc age gtg etc gtc gag ggg gac gta egg gag acg 52 8 
His Val Thr Gin Arg Ser Val Leu Val Glu Gly Asp Val Arg Glu Thr 
165 170 175 

gtg ccg cgc tac etc gcg gag aac ccg cag acc gtc ate gcg ctg gcg 576 
Val Pro Arg Tyr Leu Ala Glu Asn Pro Gin Thr Val lie Ala Leu Ala 
180 185 190 

tac ttc gac etc gac etc tac gag ccg acg aag gee gtc ctg gag gcg 624 
Tyr Phe Asp Leu Asp Leu Tyr Glu Pro Thr Lys Ala Val Leu Glu Ala 
195 200 205 

ate cgc ccc tac etc acc aag ggc age ate gtc gee ttc gac gaa etc 672 
He Arg Pro Tyr Leu Thr Lys Gly Ser He Val Ala Phe Asp Glu Leu 
210 215 220 

gac aat ccg aag tgg ccc ggc gag aac ate gcg atg egg aag gtg etc 72 0 
Asp Asn Pro Lys Trp Pro Gly Glu Asn He Ala Met Arg Lys Val Leu 
225 230 235 240 

ggg ctg gac cac gee ccg ctg cgc ctg ctg ccg ggc cgc ccg gcg ccg 
Gly Leu Asp His Ala Pro Leu Arg Leu Leu Pro Gly Arg Pro Ala Pro 
245 250 255 

gcg tac ctg egg tgg ggc gac tga 7 92 

Ala Tyr Leu Arg Trp Gly Asp * 
260 



768 
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<210> 12 
<211> 263 
<212> PRT 
<213> Bacteria 



<400> 12 
Met Arg Trp Arg 
1 

Ala Phe Val His 
20 

Thr Lys Leu Leu 
35 

Asn Leu Pro Leu 
50 

Met Asp Ala Leu 
65 

Glu Phe Gly Val 

Arg Gly Val Tyr 
100 

Asp Thr Phe Thr 
115 

Pro Thr Ala Tyr 
130 

Tyr Leu Lys Glu 
145 

His Val Thr Gin 

Val Pro Arg Tyr 
180 

Tyr Phe Asp Leu 
195 

lie Arg Pro Tyr 
210 

Asp Asn Pro Lys 
225 

Gly Leu Asp His 

Ala Tyr Leu Arg 
260 



Leu Arg Met Asp 
5 

Asp Ser Pro Glu 

Thr Asn Ser Pro 
40 

Phe Leu Arg Arg 
55 

Tyr Arg Gin Val 
70 

Arg Phe Gly Arg 
85 

Glu Pro Tyr Asn 

Gly Phe Pro Asp 
120 

Gin Gly Arg Phe 
135 

Val Leu Asp Ala 
150 

Arg Ser Val Leu 
165 

Leu Ala Glu Asn 

Asp Leu Tyr Glu 
200 

Leu Thr Lys Gly 
215 

Trp Pro Gly Glu 
230 

Ala Pro Leu Arg 
245 

Trp Gly Asp 



Ser Gly Asp Gly 
10 

Glu Thr Glu Thr 

25 

lie Pro Thr Glu 

His Gin Met Thr 
60 

Leu Asp Val Pro 
75 

His Leu Gly Thr 
90 

Pro Leu Arg Arg 
105 

Val Asn Asp Val 

Ala Val Pro Gly 
140 

His Glu Cys Ser 
155 

Val Glu Gly Asp 
170 

Pro Gin Thr Val 
185 

Pro Thr Lys Ala 

Ser lie Val Ala 
220 

Asn lie Ala Met 
235 

Leu Leu Pro Gly 
250 



Gin Asp Leu Arg 
15 

Thr Gin Arg Leu 
30 

Glu Leu Val Asn 
45 

Asp Leu Leu Ser 

Gly Val lie Met 
80 

Phe Ala Ala Leu 
95 

He Val Gly Phe 
110 

Asp Arg Val Gly 
125 

Gly Tyr Pro Ala 

Asp Phe Phe Gly 
160 

Val Arg Glu Thr 

175 

He Ala Leu Ala 
190 

Val Leu Glu Ala 
205 

Phe Asp Glu Leu 

Arg Lys Val Leu 
240 

Arg Pro Ala Pro 
255 



<210> 13 
<211> 738 
<212> DNA 
<213> Bacteria 



<220> 

<221> CDS 

<222> {!)... (738) 



<400> 13 

atg ttc gga ccg gag cac gcc gag gtg tac gag gcc gcc tac cgc ggc 48 
Met Phe Gly Pro Glu His Ala Glu Val Tyr Glu Ala Ala Tyr Arg Gly 
15 10 15 



cgc ggc aag age tgg cac gac gag gcg gcg gac gtg gcc gac egg ate 96 
Arg Gly Lys Ser Trp His Asp Glu Ala Ala Asp Val Ala Asp Arg He 
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20 



25 30 



egg gec gec cgc ccc gac gec gec egg ctg etc gac gtc ggc tgc ggc 
Arg Ala Ala Arg Pro Asp Ala Ala Arg Leu Leu Asp Val Gly Cys Gly 

40 45 



35 



acc ggc gcg cac etc gag acc ttc gcg acc cgc ttc ccc cac gtg gag 
Thr Gly Ala His Leu Glu Thr Phe Ala Thr Arg Phe Pro His Val Glu 

55 60 



50 



ggg etc gaa ctg gec ccg gcg atg ctg gcg etc gec cga cac egg ctg 
Gly Leu Glu Leu Ala Pro Ala Met Leu Ala Leu Ala Arg His Arg Leu 

70 75 80 



65 



acc ccg aca atg gtt tga 
Thr Pro Thr Met Val * 
245 



144 



192 



240 



288 



432 



480 



ccc ggg gtg cgc ctg cac gee ggg gac atg egg acg ttc gac ctt ggc 
Pro Gly Val Arg Leu His Ala Gly Asp Met Arg Thr Phe Asp Leu Gly 
85 90 95 

gtc acg ttc gac gcg gtg acc tgc ctg ttc acc gcg gtc aac ttc etc 336 
Val Thr Phe Asp Ala Val Thr Cys Leu Phe Thr Ala Val Asn Phe Leu 
100 105 HO 

ggc acg gtg gec gag atg egg gcg gee gtg gee gcg atg teg gec cac 384 
Gly Thr Val Ala Glu Met Arg Ala Ala Val Ala Ala Met Ser Ala His 
115 120 125 

ctg gcg ccg ggc ggc gtg ctg gtg etc gaa ccg tgg tgg ttc ccg gag 
Leu Ala Pro Gly Gly Val Leu Val Leu Glu Pro Trp Trp Phe Pro Glu 
130 135 140 

egg ttc ate gac ggg tac gtc ggc ggc gac ctg gtg cgc gag gag ggc 
Arg Phe He Asp Gly Tyr Val Gly Gly Asp Leu Val Arg Glu Glu Gly 
145 150 155 160 

cgc acg gtg gcg egg gtg teg egg tec acc egg cag gga egg gtg acg 52 8 
Arg Thr Val Ala Arg Val Ser Arg Ser Thr Arg Gin Gly Arg Val Thr 
165 170 175 

egg atg gag gag cgc tgg etc gtc ggc gac gee gec ggg ate egg gag 576 
Arg Met Glu Glu Arg Trp Leu Val Gly Asp Ala Ala Gly He Arg Glu 
180 185 190 

ttc age cag gtc ggc ctg etc acc atg ttc acc cgc gag gag tac gac 624 
Phe Ser Gin Val Gly Leu Leu Thr Met Phe Thr Arg Glu Glu Tyr Asp 
195 200 205 

gcg gcg ttc gee get gee ggc tgc gag tec gcg tac gtc gag ggc tgg 6 72 
Ala Ala Phe Ala Ala Ala Gly Cys Glu Ser Ala Tyr Val Glu Gly Trp 
210 215 220 

ctg acc ggc egg ggc ctt ttc gtg gcg acg cgt acc ggt gga cac gec 72 0 
Leu Thr Gly Arg Gly Leu Phe Val Ala Thr Arg Thr Gly Gly His Ala 
225 230 235 240 



738 
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<210> 14 
<211> 245 
<212> PRT 
<213> Bacteria 



<400> 14 
Met Phe Gly Pro 
1 

Arg Gly Lys Ser 
20 

Arg Ala Ala Arg 
35 

Thr Gly Ala His 
50 

Gly Leu Glu Leu 
65 

Pro Gly Val Arg 

Val Thr Phe Asp 
100 

Gly Thr Val Ala 
115 

Leu Ala Pro Gly 
130 

Arg Phe lie Asp 
145 

Arg Thr Val Ala 

Arg Met Glu Glu 
180 

Phe Ser Gin Val 
195 

Ala Ala Phe Ala 
210 

Leu Thr Gly Arg 
225 

Thr Pro Thr Met 



Glu His Ala Glu 
5 

Trp His Asp Glu 

Pro Asp Ala Ala 
40 

Leu Glu Thr Phe 
55 

Ala Pro Ala Met 
70 

Leu His Ala Gly 
85 

Ala Val Thr Cys 

Glu Met Arg Ala 
120 

Gly Val Leu Val 
135 

Gly Tyr Val Gly 
150 

Arg Val Ser Arg 
165 

Arg Trp Leu Val 

Gly Leu Leu Thr 
200 

Ala Ala Gly Cys 
215 

Gly Leu Phe Val 
230 

Val 
245 



Val Tyr Glu Ala 
10 

Ala Ala Asp Val 
25 

Arg Leu Leu Asp 

Ala Thr Arg Phe 
60 

Leu Ala Leu Ala 
75 

Asp Met Arg Thr 
90 

Leu Phe Thr Ala 
105 

Ala Val Ala Ala 

Leu Glu Pro Trp 
140 

Gly Asp Leu Val 
155 

Ser Thr Arg Gin 
170 

Gly Asp Ala Ala 
185 

Met Phe Thr Arg 

Glu Ser Ala Tyr 
220 

Ala Thr Arg Thr 
235 



Ala Tyr Arg Gly 
15 

Ala Asp Arg lie 
30 

Val Gly Cys Gly 
45 

Pro His Val Glu 

Arg His Arg Leu 
80 

Phe Asp Leu Gly 
95 

Val Asn Phe Leu 
110 

Met Ser Ala His 
125 

Trp Phe Pro Glu 

Arg Glu Glu Gly 
160 

Gly Arg Val Thr 
175 

Gly lie Arg Glu 
190 

Glu Glu Tyr Asp 
205 

Val Glu Gly Trp 

Gly Gly His Ala 
240 



<210> 15 

<211> 1707 

<212> DNA 

<213> Bacteria 



<220> 

<221> CDS 

<222> (1) . . - (1707) 



<400> 15 

gtg ccg gac cac gac cag cag cct 
Val Pro Asp His Asp Gin Gin Pro 
1 5 

tac ggg ccc ggt ggc etc gac cac 
Tyr Gly Pro Gly Gly Leu Asp His 
20 



cgc cac ggc ggc acg ctg cgc tac 48 
Arg His Gly Gly Thr Leu Arg Tyr 
10 15 

ctg gac ccc gec gec gcg tac tac 96 
Leu Asp Pro Ala Ala Ala Tyr Tyr 
25 30 
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gcc ttc tec cac cag gtc ate egg etc ttc gee egg cag ctg ttc age 144 
Ala Phe Ser His Gin Val lie Arg Leu Phe Ala Arg Gin Leu Phe Ser 

35 40 45 

tac ccg acc acg gag gac gcc gcc gcg ctg gtg ccg gtg ccc gac gtg 192 

Tyr Pro Thr Thr Glu Asp Ala Ala Ala Leu Val Pro Val Pro Asp Val 

50 55 60 

gcc gcc gag ttg ccc acg gtg gac aat ggc ggg etc age gag gac ggc 24 0 
Ala Ala Glu Leu Pro Thr Val Asp Asn Gly Gly Leu Ser Glu Asp Gly 

65 70 75 80 

cgc acg tac acg ate cgc ctg cgc gac ggg gtc egg tgg gac acc gcc 288 
Arg Thr Tyr Thr lie Arg Leu Arg Asp Gly Val Arg Trp Asp Thr Ala 

85 90 95 

ccg ccg egg ccg gtg acc gcg ggg gac ttc gtg cgc ggc ttc aag egg 336 
Pro Pro Arg Pro Val Thr Ala Gly Asp Phe Val Arg Gly Phe Lys Arg 

100 105 110 

atg gcc aac ccg gtc gcc ggg gcc ggc gcc ate gcc tac tac acg age 3 84 

Met Ala Asn Pro Val Ala Gly Ala Gly Ala lie Ala Tyr Tyr Thr Ser 

115 120 125 

acc ate gcc ggc atg gcg gag ttc gcc gag ggc tac cgc gcg cgc ttc 432 

Thr He Ala Gly Met Ala Glu Phe Ala Glu Gly Tyr Arg Ala Arg Phe 

130 135 140 

gcc ggg cgt acg ccc acc gcc gcc gag ctg gcc gac tac cag aac ggc 4 80 

Ala Gly Arg Thr Pro Thr Ala Ala Glu Leu Ala Asp Tyr Gin Asn Gly 

145 150 155 160 

cac gag ate age ggg ctg tgg gcc aag gac gac egg acc ctg gtg ate 52 8 

His Glu He Ser Gly Leu Trp Ala Lys Asp Asp Arg Thr Leu Val He 

165 170 175 

gag ctg ctg cgc ccc gcc aac gac atg etc aac ctg ctg gcg atg ccg 5 76 

Glu Leu Leu Arg Pro Ala Asn Asp Met Leu Asn Leu Leu Ala Met Pro 

180 185 190 

ttc gcc tec gcc gcg ccc egg gag ttc gac gac etc gtc ccg gac ggt 624 

Phe Ala Ser Ala Ala Pro Arg Glu Phe Asp Asp Leu Val Pro Asp Gly 

195 200 205 

ccg gac ttc gcg egg ctg gtc cgc tec aac ggg ccg tac egg ate acc 672 

Pro Asp Phe Ala Arg Leu Val Arg Ser Asn Gly Pro Tyr Arg He Thr 

210 215 220 

ggc tac gee egg ggc age cac ctg acc atg gac cac aac ccc gcc tgg 720 

Gly Tyr Ala Arg Gly Ser His Leu Thr Met Asp His Asn Pro Ala Trp 

225 230 235 240 

c 99 9 CC £F ac 9 ca !? ac cc 9" atc C 9 C C 9 C c gc tac 9^9 9 ac c 9t atc 9^9 768 

Arg Ala Asp Ala Asp Pro He Arg Arg Arg Tyr Val Asp Arg He Glu 

245 250 255 

gtg egg atg gcg agg gtg age gac gag egg gtc cgc gcc gag atc gag 816 

Val Arg Met Ala Arg Val Ser Asp Glu Arg Val Arg Ala Glu He Glu 
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260 



265 



270 



age ggg gcg gec gac ctg teg tgg ggc gec gec gtg ggc agg ccc cgc 864 
Ser Gly Ala Ala Asp Leu Ser Trp Gly Ala Ala Val Gly Arg Pro Arg 
275 280 285 

egg cgt acg gcg gec gac egg aac etc ggc tgg gcg ctg aac ccc tac 912 
Arg Arg Thr Ala Ala Asp Arg Asn Leu Gly Trp Ala Leu Asn Pro Tyr 
290 295 300 

ctg gcg ttc aac ctg cac age ccg cac gag egg ggg gcg ctg cgc gac 96 0 
Leu Ala Phe Asn Leu His Ser Pro His Glu Arg Gly Ala Leu Arg Asp 
305 310 315 320 

egg acc gtc egg ctg gcg ate gcg tac gec gtc gac aag gcg egg etc 10 08 
Arg Thr Val Arg Leu Ala lie Ala Tyr Ala Val Asp Lys Ala Arg Leu 
325 330 335 

gtc egg ttc ttc gac gac atg aac ate ggc acg gtg acc egc ccc gcg 1056 
Val Arg Phe Phe Asp Asp Met Asn lie Gly Thr Val Thr Arg Pro Ala 
340 345 350 

cac acg gee ate ccg ccg ggc aac ttc ggc cac cgc gag tac gac ccg 1104 
His Thr Ala lie Pro Pro Gly Asn Phe Gly His Arg Glu Tyr Asp Pro 
355 360 365 

tac ccg acg ccg ggg gac egg ggc gac egg gcg cgc tgc egg gag ctg 1152 
Tyr Pro Thr Pro Gly Asp Arg Gly Asp Arg Ala Arg Cys Arg Glu Leu 
370 375 380 

etc gec gag gee ggg tac ccc gac ggg ctg egg etc acc atg ate tac 12 0 0 
Leu Ala Glu Ala Gly Tyr Pro Asp Gly Leu Arg Leu Thr Met lie Tyr 
385 390 395 400 

egg ate gac gcg gtg cac ggc cag gtg gec aag gcg ate gec gag gac 124 8 
Arg lie Asp Ala Val His Gly Gin Val Ala Lys Ala lie Ala Glu Asp 
405 410 415 

ct 9 99 c g c 9" gg c gg c gtc gac gtc egg ctg gtc gag ate gac cag acc 12 96 
Leu Gly Ala Gly Gly Val Asp Val Arg Leu Val Glu lie Asp Gin Thr 
420 425 430 

gac gag tac tac cgc ate etc cag gac ccg gee cgc gcg gcg gcg ggg 1344 
Asp Glu Tyr Tyr Arg lie Leu Gin Asp Pro Ala Arg Ala Ala Ala Gly 
435 440 445 

gag tgg gac ate acg ccg gec gee tgg atg ccg gac tgg ttc ggc aac 13 92 
Glu Trp Asp He Thr Pro Ala Ala Trp Met Pro Asp Trp Phe Gly Asn 
450 455 460 

aac ggg egg teg tac gtc cag ccg atg ttc cag tec aac acc ggc gtc 144 0 
Asn Gly Arg Ser Tyr Val Gin Pro Met Phe Gin Ser Asn Thr Gly Val 
465 470 475 480 

ggc acg gec aac tac ggc ggc tac cac aac ccg etc gtc gac gag ctg 14 88 
Gly Thr Ala Asn Tyr Gly Gly Tyr His Asn Pro Leu Val Asp Glu Leu 
485 490 495 
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ate gac cgc gcg ttg tec gec egg acg gag gec gag gcg gag gag ctg 
lie Asp Arg Ala Leu Ser Ala Arg Thr Glu Ala Glu Ala Glu Glu Leu 
500 505 510 



1536 



tgg cac egg gtc gac egg 
Trp His Arg Val Asp Arg 
515 

ate ctg gec tgc gag ccg 
lie Leu Ala Cys Glu Pro 
530 

egg ggg gcg ate ccg ctg 
Arg Gly Ala lie Pro Leu 
545 550 

aac etc tgg ctg gac ccg 
Asn Leu Trp Leu Asp Pro 
565 



cag gtg ctg cag gac gtg 
Gin Val Leu Gin Asp Val 
520 

acc ate gag cac ctg ace 
Thr lie Glu His Leu Thr 
535 540 

ccg cac gtg gac cgc tgg 
Pro His Val Asp Arg Trp 
555 

ccc gac tga 
Pro Asp * 



gcg ate gtg ccg 1584 

Ala lie Val Pro 

525 

agt tec egg gtg 1632 
Ser Ser Arg Val 



tac gac gcg gcg 168 0 
Tyr Asp Ala Ala 
560 

1707 



<210> 16 
<211> 568 
<212> PRT 
<213> Bacteria 



<400> 16 
Val Pro Asp His 
1 

Tyr Gly Pro Gly 
20 

Ala Phe Ser His 
35 

Tyr Pro Thr Thr 
50 

Ala Ala Glu Leu 
65 

Arg Thr Tyr Thr 

Pro Pro Arg Pro 
100 

Met Ala Asn Pro 
115 

Thr He Ala Gly 
130 

Ala Gly Arg Thr 
145 

His Glu He Ser 

Glu Leu Leu Arg 
180 

Phe Ala Ser Ala 
195 

Pro Asp Phe Ala 
210 

Gly Tyr Ala Arg 
225 

Arg Ala Asp Ala 



Asp Gin Gin Pro 
5 

Gly Leu Asp His 

Gin Val He Arg 
40 

Glu Asp Ala Ala 
55 

Pro Thr Val Asp 
70 

He Arg Leu Arg 
85 

Val Thr Ala Gly 

Val Ala Gly Ala 
120 

Met Ala Glu Phe 
135 

Pro Thr Ala Ala 
150 

Gly Leu Trp Ala 
165 

Pro Ala Asn Asp 

Ala Pro Arg Glu 
200 

Arg Leu Val Arg 
215 

Gly Ser His Leu 
230 

Asp Pro He Arg 



Arg His Gly Gly 
10 

Leu Asp Pro Ala 
25 

Leu Phe Ala Arg 

Ala Leu Val Pro 
60 

Asn Gly Gly Leu 
75 

Asp Gly Val Arg 
90 

Asp Phe Val Arg 
105 

Gly Ala He Ala 

Ala Glu Gly Tyr 
140 

Glu Leu Ala Asp 
155 

Lys Asp Asp Arg 
170 

Met Leu Asn Leu 
185 

Phe Asp Asp Leu 

Ser Asn Gly Pro 
220 

Thr Met Asp His 
235 

Arg Arg Tyr Val 



Thr Leu Arg Tyr 
15 

Ala Ala Tyr Tyr 
30 

Gin Leu Phe Ser 
45 

Val Pro Asp Val 

Ser Glu Asp Gly 
80 

Trp Asp Thr Ala 
95 

Gly Phe Lys Arg 
110 

Tyr Tyr Thr Ser 
125 

Arg Ala Arg Phe 

Tyr Gin Asn Gly 
160 

Thr Leu Val He 
175 

Leu Ala Met Pro 
190 

Val Pro Asp Gly 
205 

Tyr Arg He Thr 

Asn Pro Ala Trp 
240 

Asp Arg He Glu 
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Val Arg Met Ala 
260 

Ser Gly Ala Ala 
275 

Arg Arg Thr Ala 
290 

Leu Ala Phe Asn 
305 

Arg Thr Val Arg 

Val Arg Phe Phe 
340 

His Thr Ala lie 
355 

Tyr Pro Thr Pro 
370 

Leu Ala Glu Ala 
385 

Arg lie Asp Ala 

Leu Gly Ala Gly 
420 

Asp Glu Tyr Tyr 
435 

Glu Trp Asp lie 
450 

Asn Gly Arg Ser 
465 

Gly Thr Ala Asn 

lie Asp Arg Ala 
500 

Trp His Arg Val 
515 

lie Leu Ala Cys 
530 

Arg Gly Ala lie 
545 

Asn Leu Trp Leu 



245 

Arg Val Ser Asp 

Asp Leu Ser Trp 
280 

Ala Asp Arg Asn 
295 

Leu His Ser Pro 
310 

Leu Ala lie Ala 
325 

Asp Asp Met Asn 

Pro Pro Gly Asn 
360 

Gly Asp Arg Gly 
375 

Gly Tyr Pro Asp 
390 

Val His Gly Gin 
405 

Gly Val Asp Val 

Arg He Leu Gin 
440 

Thr Pro Ala Ala 
455 

Tyr Val Gin Pro 
470 

Tyr Gly Gly Tyr 
485 

Leu Ser Ala Arg 
Asp Arg Gin Val 

520 

Glu Pro Thr He 
535 

Pro Leu Pro His 
550 

Asp Pro Pro Asp 
565 



250 

Glu Arg Val Arg 
265 

Gly Ala Ala Val 

Leu Gly Trp Ala 
300 

His Glu Arg Gly 
315 

Tyr Ala Val Asp 
330 

lie Gly Thr Val 
345 

Phe Gly His Arg 

Asp Arg Ala Arg 
380 

Gly Leu Arg Leu 
395 

Val Ala Lys Ala 
410 

Arg Leu Val Glu 
425 

Asp Pro Ala Arg 

Trp Met Pro Asp 
460 

Met Phe Gin Ser 
475 

His Asn Pro Leu 
490 

Thr Glu Ala Glu 
505 

Leu Gin Asp Val 

Glu His Leu Thr 
540 

Val Asp Arg Trp 
555 



255 

Ala Glu He Glu 

270 

Gly Arg Pro Arg 
285 

Leu Asn Pro Tyr 

Ala Leu Arg Asp 
320 

Lys Ala Arg Leu 
335 

Thr Arg Pro Ala 
350 

Glu Tyr Asp Pro 
365 

Cys Arg Glu Leu 

Thr Met He Tyr 
400 

He Ala Glu Asp 
415 

He Asp Gin Thr 
430 

Ala Ala Ala Gly 
445 

Trp Phe Gly Asn 

Asn Thr Gly Val 
480 

Val Asp Glu Leu 
495 

Ala Glu Glu Leu 
510 

Ala He Val Pro 
525 

Ser Ser Arg Val 

Tyr Asp Ala Ala 
560 



<210> 17 
<211> 999 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (999) 

<400> 17 

atg gac agg ttg cag teg gcg ctg 
Met Asp Arg Leu Gin Ser Ala Leu 
1 5 

tac acg tac gcg gca gec ctg egg 



gec etc tac gag gag gcg atg ggc 48 
Ala Leu Tyr Glu Glu Ala Met Gly 
10 15 

gec gee gee gec gtc ggc gtc gec 96 
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Tyr Thr Tyr Ala Ala Ala Leu Arg Ala Ala Ala Ala Val Gly Val Ala 
20 25 30 



gac cac ctg gtc gac ggc ccc cgt acg ccc gcc gag ctg gcc gcc gcg 144 
Asp His Leu Val Asp Gly Pro Arg Thr Pro Ala Glu Leu Ala Ala Ala 
35 40 45 



a ccj ggc acc gac gcg gac gcg etc cgc egg gtg ctg cgc ctg ctg gcg 192 
Thr Gly Thr Asp Ala Asp Ala Leu Arg Arg Val Leu Arg Leu Leu Ala 
50 55 60 



gtc cgc gac gtg gtc cgc gag tec gac ggc egg ttc gcg ctg acc gac 240 
Val Arg Asp Val Val Arg Glu Ser Asp Gly Arg Phe Ala Leu Thr Asp 
65 70 75 80 



aa 9 ggc gcg gcg ctg egg teg gac teg ccg gtg ccc gcg egg gcc ggc 288 
Lys Gly Ala Ala Leu Arg Ser Asp Ser Pro Val Pro Ala Arg Ala Gly 
85 90 95 



ate etc atg ttc acc gac acg atg ttc tgg acg atg agt cac egg gtg 336 
lie Leu Met Phe Thr Asp Thr Met Phe Trp Thr Met Ser His Arg Val 
100 105 110 



gcg age gcg ctg ggg ccg gag cga ccc gcc ttc gcc gac ate ttc ggt 3 84 
Ala Ser Ala Leu Gly Pro Glu Arg Pro Ala Phe Ala Asp lie Phe Gly 
115 120 125 



age teg ctg gac gcc tac ttc gac ggc gac gcc gag gtc gag gcg etc 432 
Ser Ser Leu Asp Ala Tyr Phe Asp Gly Asp Ala Glu Val Glu Ala Leu 
130 135 140 



tac tac gag ggc atg gaa acg gtc age gcg gcg gag cac etc att etc 480 
Tyr Tyr Glu Gly Met Glu Thr Val Ser Ala Ala Glu His Leu lie Leu 
145 150 155 160 



gcc cgc gcc ggt gac ttc ccc gcc acc ggc acc gtg gcg gac gtc ggc 528 
Ala Arg Ala Gly Asp Phe Pro Ala Thr Gly Thr Val Ala Asp Val Gly 
165 170 175 



9S C 99 c c 99 99° 99 c ttc ct 9 ctc acc 9 tc cta C 9 C 9 a 9 cac ccc 99° 576 
Gly Gly Arg Gly Gly Phe Leu Leu Thr Val Leu Arg Glu His Pro Gly 
180 185 190 



ctg cag ggc gtg ctg ctg gac cgc gcg gag gtg gtc gcc egg cac egg 624 
Leu Gin Gly Val Leu Leu Asp Arg Ala Glu Val Val Ala Arg His Arg 
195 200 205 



ctg gac gcc ccg gac gtg gcg ggg cgc tgg aag gtt gtc gag ggc gac 672 
Leu Asp Ala Pro Asp Val Ala Gly Arg Trp Lys Val Val Glu Gly Asp 
210 215 220 



ttc ctc cgc gag gtg ccc cac gcc gac gtg cac gtg ctc aag cgc ate 72 0 
Phe Leu Arg Glu Val Pro His Ala Asp Val His Val Leu Lys Arg lie 
225 230 235 240 



ctg cac aac tgg ggc gac gag gac age gtc egg ate ctg acg aac tgc 768 
Leu His Asn Trp Gly Asp Glu Asp Ser Val Arg lie Leu Thr Asn Cys 
245 250 255 
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cgc egg gtc atg ccc gcg cac ggc egg gtg etc gtg ate gac gcg gtc 816 
Arg Arg Val Met Pro Ala His Gly Arg Val Leu Val lie Asp Ala Val 
260 265 270 



gtc ccc gag ggc aac gac gcg cac cag age aag gag atg gac ttc atg 864 
Val Pro Glu Gly Asn Asp Ala His Gin Ser Lys Glu Met Asp Phe Met 
275 280 285 

atg etc gee gcg cgc acc ggc cag gaa cgc ace gee gec gag ctg gag 912 
Met Leu Ala Ala Arg Thr Gly Gin Glu Arg Thr Ala Ala Glu Leu Glu 
290 295 300 

ccg ttg ttc acc gcg gee ggg ctg cgc ctg gac egg gtc gtc ggc acc 96 0 
Pro Leu Phe Thr Ala Ala Gly Leu Arg Leu Asp Arg Val Val Gly Thr 
305 310 315 320 

teg teg gtc atg tec ate gcg gtc ggc gtg ccg gec tga 999 
Ser Ser Val Met Ser lie Ala Val Gly Val Pro Ala * 
325 330 



<210> 18 
<211> 332 
<212> PRT 
<213> Bacteria 



<400> 18 
Met Asp Arg Leu 
1 

Tyr Thr Tyr Ala 
20 

Asp His Leu Val 
35 

Thr Gly Thr Asp 
50 

Val Arg Asp Val 
65 

Lys Gly Ala Ala 

lie Leu Met Phe 
100 

Ala Ser Ala Leu 
115 

Ser Ser Leu Asp 
130 

Tyr Tyr Glu Gly 
145 

Ala Arg Ala Gly 

Gly Gly Arg Gly 
180 

Leu Gin Gly Val 
195 

Leu Asp Ala Pro 
210 

Phe Leu Arg Glu 
225 



Gin Ser Ala Leu 
5 

Ala Ala Leu Arg 

Asp Gly Pro Arg 
40 

Ala Asp Ala Leu 
55 

Val Arg Glu Ser 
70 

Leu Arg Ser Asp 
85 

Thr Asp Thr Met 

Gly Pro Glu Arg 
120 

Ala Tyr Phe Asp 
135 

Met Glu Thr Val 
150 

Asp Phe Pro Ala 
165 

Gly Phe Leu Leu 

Leu Leu Asp Arg 
200 

Asp Val Ala Gly 
215 

Val Pro His Ala 
230 



Ala Leu Tyr Glu 
10 

Ala Ala Ala Ala 
25 

Thr Pro Ala Glu 

Arg Arg Val Leu 
60 

Asp Gly Arg Phe 
75 

Ser Pro Val Pro 
90 

Phe Trp Thr Met 
105 

Pro Ala Phe Ala 

Gly Asp Ala Glu 
140 

Ser Ala Ala Glu 
155 

Thr Gly Thr Val 
170 

Thr Val Leu Arg 
185 

Ala Glu Val Val 

Arg Trp Lys Val 
220 

Asp Val His Val 
235 



Glu Ala Met Gly 
15 

Val Gly Val Ala 
30 

Leu Ala Ala Ala 
45 

Arg Leu Leu Ala 

Ala Leu Thr Asp 
80 

Ala Arg Ala Gly 
95 

Ser His Arg Val 
110 

Asp lie Phe Gly 
125 

Val Glu Ala Leu 

His Leu lie Leu 
160 

Ala Asp Val Gly 
175 

Glu His Pro Gly 
190 

Ala Arg His Arg 
205 

Val Glu Gly Asp 

Leu Lys Arg lie 
240 
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Leu His Asn Trp Gly 
245 

Arg Arg Val Met Pro 
260 

Val Pro Glu Gly Asn 
275 

Met Leu Ala Ala Arg 
290 

Pro Leu Phe Thr Ala 
305 

Ser Ser Val Met Ser 
325 



Asp Glu Asp Ser Val 
250 

Ala His Gly Arg Val 
265 

Asp Ala His Gin Ser 
280 

Thr Gly Gin Glu Arg 
295 

Ala Gly Leu Arg Leu 
310 

lie Ala Val Gly Val 
330 



Arg lie Leu Thr Asn Cys 
255 

Leu Val He Asp Ala Val 
270 

Lys Glu Met Asp Phe Met 
285 

Thr Ala Ala Glu Leu Glu 
300 

Asp Arg Val Val Gly Thr 
315 320 
Pro Ala 



<210> 19 
<211> 1323 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (1323) 

<400> 19 

gtg age cgt acc gtg gag tec cct ggc ccg gec acc gtg teg gcg tea 48 
Val Ser Arg Thr Val Glu Ser Pro Gly Pro Ala Thr Val Ser Ala Ser 
15 10 15 

ccg gcg cag agt ccg ctg cgc acc gcg tec tgg gec cgc ate cgc gag 96 
Pro Ala Gin Ser Pro Leu Arg Thr Ala Ser Trp Ala Arg He Arg Glu 
20 25 30 

ctg ttc gec ctg gac ccg acg acc gtg cac etc aac acg ggg acg gtc 144 
Leu Phe Ala Leu Asp Pro Thr Thr Val His Leu Asn Thr Gly Thr Val 
35 40 45 

ggc gec atg ccg tac gag gtg ctg gac acc gtg gac egg gtg acc cgc 192 
Gly Ala Met Pro Tyr Glu Val Leu Asp Thr Val Asp Arg Val Thr Arg 
50 55 60 

cag tgg acc ggc ggc ctg etc gac gtc tac cgc ccg gcg atg ttc acc 240 
Gin Trp Thr Gly Gly Leu Leu Asp Val Tyr Arg Pro Ala Met Phe Thr 
65 70 75 80 

gag tac egg gac gec ate gcg aag acg ttc ggc gtg gac ggc gac gag 288 
Glu Tyr Arg Asp Ala He Ala Lys Thr Phe Gly Val Asp Gly Asp Glu 
85 90 95 

ate gtg ate tgc cac aac gee acc gag ggg gtc gee egg gtc ate cac 336 
He Val He Cys His Asn Ala Thr Glu Gly Val Ala Arg Val He His 
100 105 110 

ggc etc gac ctg cgc gag ggc gac gag gtg gtg acg acc acg cac gag 384 
Gly Leu Asp Leu Arg Glu Gly Asp Glu Val Val Thr Thr Thr His Glu 
115 120 125 

tgc tac tec gtg ctg tec aac ttc aac ctg ctg cgc aac egg ttc ggg 432 
Cys Tyr Ser Val Leu Ser Asn Phe Asn Leu Leu Arg Asn Arg Phe Gly 
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480 



528 



576 



130 135 140 

gtg gtg ctg aag acc gtc acc ccg ccg tec ggc cac gag gtg cgc gcg 
Val Val Leu Lys Thr Val Thr Pro Pro Ser Gly His Glu Val Arg Ala 
145 150 155 160 

gag gag ate gtc gag ctg gtc gag gec gec ate acg ccc egg acg aag 
Glu Glu lie Val Glu Leu Val Glu Ala Ala He Thr Pro Arg Thr Lys 
165 170 175 

gtg etc teg ttc gee gcg ate acc etc ttc acc ggg acg atg ttc ccc 
Val Leu Ser Phe Ala Ala He Thr Leu Phe Thr Gly Thr Met Phe Pro 
180 185 190 

ate egg cag etc tgc gag ctg gcg cac egg cac ggg ctg acc acc gtc 624 
He Arg Gin Leu Cys Glu Leu Ala His Arg His Gly Leu Thr Thr Val 
195 200 205 

ate gac ggc gcg ctg ate ccc ggc atg etc gac tgc gac ctg cgc gcg 672 
He Asp Gly Ala Leu He Pro Gly Met Leu Asp Cys Asp Leu Arg Ala 
210 215 220 

acc ggg gcg gac ttc ate acc tgc tec ggg teg aag ttc cag tgc ggc 
Thr Gly Ala Asp Phe He Thr Cys Ser Gly Ser Lys Phe Gin Cys Gly 
225 230 235 240 

ccg etc ggc acc ggc ctg ate tac gtc cgc aac aag gtc gtc ccc gag 
Pro Leu Gly Thr Gly Leu He Tyr Val Arg Asn Lys Val Val Pro Glu 
245 250 255 

cac aac ccc ctg ccg ctg ccc acg ttc tgg ccg etc ate teg acc tgg 
His Asn Pro Leu Pro Leu Pro Thr Phe Trp Pro Leu He Ser Thr Trp 
260 265 270 

tac ccg atg atg ggc age ccg ccg ccg egg acc age acc gee gtg gag 
Tyr Pro Met Met Gly Ser Pro Pro Pro Arg Thr Ser Thr Ala Val Glu 
275 280 285 

age tac aac atg ggc gac ttc ctg cag age gec ggc age gec aac ctg 912 
Ser Tyr Asn Met Gly Asp Phe Leu Gin Ser Ala Gly Ser Ala Asn Leu 
290 295 300 

gcg egg ggc gec gee ctg gec egg gec ttc gag ctg tgg gac gac ate 96 0 
Ala Arg Gly Ala Ala Leu Ala Arg Ala Phe Glu Leu Trp Asp Asp He 
305 310 315 320 

ggc cgc gac cgc ate gag gcg tac ate atg gac etc gee gag tac gee 
Gly Arg Asp Arg He Glu Ala Tyr He Met Asp Leu Ala Glu Tyr Ala 
325 330 335 

cgc ggc egg etc ate gac gcg ttc ggc gtc gag gee atg tac tec ccc 1056 
Arg Gly Arg Leu He Asp Ala Phe Gly Val Glu Ala Met Tyr Ser Pro 
340 345 350 

ggc gee gac ccg egg ctg cgc teg ccg ctg etc gcg ttc aac ccg ttc 1104 
Gly Ala Asp Pro Arg Leu Arg Ser Pro Leu Leu Ala Phe Asn Pro Phe 
355 360 365 



720 



768 



816 



864 



1008 



DC01 348871 v 1 



30 



c< 39 c 99 cc< 3 9" a 9 9" ac S GC tgg aac ate aag aag ttc ate ggc ttc gtc 1152 

Arg Arg Pro Glu Asp Ala Trp Asn He Lys Lys Phe He Gly Phe Val 

370 375 380 

aag cgc ctg gag acc gag cac egg ate tgg acc cgc tgg acg gag ttc 12 00 

Lys Arg Leu Glu Thr Glu His Arg He Trp Thr Arg Trp Thr Glu Phe 
385 390 395 400 

gac gtg ccc ggc tec ccg cac cag cac tac gcg gcg cgc ate acc acg 1248 

Asp Val Pro Gly Ser Pro His Gin His Tyr Ala Ala Arg He Thr Thr 
405 410 415 

cac ctg ttc aac acc egg gaa gag ate gac cac acc gta egg acg atg 12 96 

His Leu Phe Asn Thr Arg Glu Glu He Asp His Thr Val Arg Thr Met 
420 425 430 

gtc cgc ctg gec gag gag atg tct tga 1323 

Val Arg Leu Ala Glu Glu Met Ser * 

435 440 



<210> 20 
<211> 440 
<212> PRT 
<213> Bacteria 

<400> 20 

Val Ser Arg Thr Val Glu Ser Pro Gly Pro Ala Thr Val Ser Ala Ser 

15 10 15 

Pro Ala Gin Ser Pro Leu Arg Thr Ala Ser Trp Ala Arg He Arg Glu 

20 25 30 

Leu Phe Ala Leu Asp Pro Thr Thr Val His Leu Asn Thr Gly Thr Val 

35 40 45 

Gly Ala Met Pro Tyr Glu Val Leu Asp Thr Val Asp Arg Val Thr Arg 

50 55 60 

Gin Trp Thr Gly Gly Leu Leu Asp Val Tyr Arg Pro Ala Met Phe Thr 
65 70 75 80 

Glu Tyr Arg Asp Ala He Ala Lys Thr Phe Gly Val Asp Gly Asp Glu 

85 90 95 

He Val He Cys His Asn Ala Thr Glu Gly Val Ala Arg Val He His 

100 105 110 

Gly Leu Asp Leu Arg Glu Gly Asp Glu Val Val Thr Thr Thr His Glu 

115 120 125 

Cys Tyr Ser Val Leu Ser Asn Phe Asn Leu Leu Arg Asn Arg Phe Gly 

130 135 140 

Val Val Leu Lys Thr Val Thr Pro Pro Ser Gly His Glu Val Arg Ala 
145 150 155 160 

Glu Glu He Val Glu Leu Val Glu Ala Ala He Thr Pro Arg Thr Lys 

165 170 175 

Val Leu Ser Phe Ala Ala He Thr Leu Phe Thr Gly Thr Met Phe Pro 

180 185 190 

He Arg Gin Leu Cys Glu Leu Ala His Arg His Gly Leu Thr Thr Val 

195 200 205 

He Asp Gly Ala Leu He Pro Gly Met Leu Asp Cys Asp Leu Arg Ala 

210 215 220 

Thr Gly Ala Asp Phe He Thr Cys Ser Gly Ser Lys Phe Gin Cys Gly 
225 230 235 240 

Pro Leu Gly Thr Gly Leu He Tyr Val Arg Asn Lys Val Val Pro Glu 
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His Asn Pro Leu 
260 

Tyr Pro Met Met 
275 

Ser Tyr Asn Met 
290 

Ala Arg Gly Ala 
305 

Gly Arg Asp Arg 

Arg Gly Arg Leu 
340 

Gly Ala Asp Pro 
355 

Arg Arg Pro Glu 
370 

Lys Arg Leu Glu 
385 

Asp Val Pro Gly 

His Leu Phe Asn 
420 

Val Arg Leu Ala 
435 



245 

Pro Leu Pro Thr 

Gly Ser Pro Pro 
280 

Gly Asp Phe Leu 
295 

Ala Leu Ala Arg 
310 

lie Glu Ala Tyr 
325 

lie Asp Ala Phe 

Arg Leu Arg Ser 
360 

Asp Ala Trp Asn 
375 

Thr Glu His Arg 
390 

Ser Pro His Gin 
405 

Thr Arg Glu Glu 

Glu Glu Met Ser 
440 



250 

Phe Trp Pro Leu 
265 

Pro Arg Thr Ser 

Gin Ser Ala Gly 
300 

Ala Phe Glu Leu 
315 

lie Met Asp Leu 
330 

Gly Val Glu Ala 
345 

Pro Leu Leu Ala 

lie Lys Lys Phe 
380 

lie Trp Thr Arg 
395 

His Tyr Ala Ala 
410 

lie Asp His Thr 
425 



255 

lie Ser Thr Trp 
270 

Thr Ala Val Glu 
285 

Ser Ala Asn Leu 

Trp Asp Asp He 
320 

Ala Glu Tyr Ala 
335 

Met Tyr Ser Pro 
350 

Phe Asn Pro Phe 
365 

lie Gly Phe Val 

Trp Thr Glu Phe 
400 

Arg lie Thr Thr 
415 

Val Arg Thr Met 
430 



<210> 21 
<211> 1683 
<212> DNA 
<213> Bacteria 

<220> 
<221> CDS 
<222> (1) . . . (1683) 

<400> 21 

gtg acg cag gcg cgc agt gca acg acg acg aac gac acc egg ctg egg 
Val Thr Gin Ala Arg Ser Ala Thr Thr Thr Asn Asp Thr Arg Leu Arg 
15 10 15 

ggc acc ctg egg ctg etc ggg ccc gee gee gtc cac cag gcg gac ccg 
Gly Thr Leu Arg Leu Leu Gly Pro Ala Ala Val His Gin Ala Asp Pro 
20 25 30 

gec gec gec tgg teg ccg gec gag cgc cag ttg ctg egg ctg tgc acc 144 
Ala Ala Ala Trp Ser Pro Ala Glu Arg Gin Leu Leu Arg Leu Cys Thr 
35 40 45 

egg cag ctg ate age tac cgc ccg gaa ccc gac ccg ggc gac tgg cgc 192 
Arg Gin Leu lie Ser Tyr Arg Pro Glu Pro Asp Pro Gly Asp Trp Arg 
50 55 60 



gec etc gcg ccg gtc gee gac ctg gee acc gac gtc ccc teg acc tac 
Ala Leu Ala Pro Val Ala Asp Leu Ala Thr Asp Val Pro Ser Thr Tyr 
65 70 75 80 

aac gee ggc ctg ggc gee age cac cgc age tac gtg gtg cac ctg cgc 



48 



96 



240 



288 
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Asn Ala Gly Leu Gly Ala Ser His Arg Ser Tyr Val Val His Leu Arg 
85 90 95 

ccc ggg gtg etc tgg gac acg ccg acc ccc cgc ccg gtg acg gcg cac 336 

Pro Gly Val Leu Trp Asp Thr Pro Thr Pro Arg Pro Val Thr Ala His 

100 105 110 

gac gtc gta cgc ggc ttc aag egg ctg gec aac ccg etc acc cga cac 3 84 
Asp Val Val Arg Gly Phe Lys Arg Leu Ala Asn Pro Leu Thr Arg His 
115 120 125 

ccc gcg ctg gcg tac ttc egg ggc acc ctg egg ggc atg ggc egg tac 43 2 

Pro Ala Leu Ala Tyr Phe Arg Gly Thr Leu Arg Gly Met Gly Arg Tyr 
130 135 140 

tgc gac gag tac gcg gcg gcg gtc gec ggc cac ccg gtc acc gcg gcg 48 0 

Cys Asp Glu Tyr Ala Ala Ala Val Ala Gly His Pro Val Thr Ala Ala 
145 150 155 160 

ctg etc gec ggc ttc cag gac gee cac gag ate ccc ggc gtg ttc gec 52 8 
Leu Leu Ala Gly Phe Gin Asp Ala His Glu lie Pro Gly Val Phe Ala 
165 170 175 

gtc gac gac gag acg gtg gtc ttc gag ctg gac cgt ccg gcg ctg gac 5 76 

Val Asp Asp Glu Thr Val Val Phe Glu Leu Asp Arg Pro Ala Leu Asp 

180 185 190 

ttc gtc gac atg ctg gcg cag age ggc gee tec ccg gee ccg gtg gag 624 

Phe Val Asp Met Leu Ala Gin Ser Gly Ala Ser Pro Ala Pro Val Glu 
195 200 205 

tac gac gca cac ctg ccg gga age gee ggc ctg cac gag cac ctg gtc 6 72 

Tyr Asp Ala His Leu Pro Gly Ser Ala Gly Leu His Glu His Leu Val 
210 215 220 

gec aac ggc ccg tac cgc gtc gtg teg tgg cgc ccc ggg ggc acc ate 72 0 

Ala Asn Gly Pro Tyr Arg Val Val Ser Trp Arg Pro Gly Gly Thr lie 
225 230 235 240 

egg ctg gag ccg aac ccg gcg tgg egg gcg gag acc gac ccg ate cgc 768 

Arg Leu Glu Pro Asn Pro Ala Trp Arg Ala Glu Thr Asp Pro lie Arg 
245 250 255 

gag egg egg ttc gac gec gtc gag ttc cgc gtc gee atg ggc ggg ccg 816 

Glu Arg Arg Phe Asp Ala Val Glu Phe Arg Val Ala Met Gly Gly Pro 

260 265 270 

cgc gaa ctg gee gac egg etc gec gec gac gac gec gac ctg ccg tgg 864 

Arg Glu Leu Ala Asp Arg Leu Ala Ala Asp Asp Ala Asp Leu Pro Trp 
275 280 285 

ggc gtg ccg ate ggc ccg gtg ccc ggt cag egg etc gac ccg tgc ctg 912 

Gly Val Pro lie Gly Pro Val Pro Gly Gin Arg Leu Asp Pro Cys Leu 
290 295 300 

gtg ttc aac ctg cgc gac ccc gec aac ccg gec gtc gec gac gec gcg 960 

Val Phe Asn Leu Arg Asp Pro Ala Asn Pro Ala Val Ala Asp Ala Ala 
305 310 315 320 
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gtg cgc egg gtc gtc gec ggg gcg gtc gac egg gcg gcg ctg gtg cgc 1008 
Val Arg Arg Val Val Ala Gly Ala Val Asp Arg Ala Ala Leu Val Arg 
325 330 335 

ate gec egg gec gec gac ccg tgg tec gag gtc cgc gec gcg cac ace 1056 
lie Ala Arg Ala Ala Asp Pro Trp Ser Glu Val Arg Ala Ala His Thr 
340 345 350 

gtc gtg ccg ccc ggc aac gac ggg cac egg cag ccc gac ccg etc acc 1104 
Val Val Pro Pro Gly Asn Asp Gly His Arg Gin Pro Asp Pro Leu Thr 
355 360 365 

gac ccg att ccc gac gee gac gcg gac ccg cgc gag egg etc gee gec 1152 
Asp pro lie Pro Asp Ala Asp Ala Asp Pro Arg Glu Arg Leu Ala Ala 
370 375 380 

9 C 9 999 cac cc 9 9 ac 999 etc acc ctg acc gcg gtg cac ccc gac acg 12 00 
Ala Gly His Pro Asp Gly Leu Thr Leu Thr Ala Val His Pro Asp Thr 
385 390 395 400 

gec gag gac ctg gcg ctg gec cgc teg tgg gcg gec gac etc ggc gee 124 8 
Ala Glu Asp Leu Ala Leu Ala Arg Ser Trp Ala Ala Asp Leu Gly Ala 
405 410 415 

gec ggc ate gac gta cgc ctg gtc gcg etc gac gac gec aac cac egg 1296 
Ala Gly lie Asp Val Arg Leu Val Ala Leu Asp Asp Ala Asn His Arg 
420 425 430 

gec ctg etc gee gee acg ggc gac gcg ccc ggc ctg cga tgg gac ctg 1344 
Ala Leu Leu Ala Ala Thr Gly Asp Ala Pro Gly Leu Arg Trp Asp Leu 

435 440 445 

gcg acc gee acg ttc acc gcg ccg tgg gee tac ggc aac gee egg gtg 1392 
Ala Thr Ala Thr Phe Thr Ala Pro Trp Ala Tyr Gly Asn Ala Arg Val 
450 455 460 

ttc ctg caa ccg ctg gtc ggc gag gga ccc ggc aac ccc ggc ggc tac 144 0 
Phe Leu Gin Pro Leu Val Gly Glu Gly Pro Gly Asn Pro Gly Gly Tyr 
465 470 475 480 

cgc gac ccc ggg gtt gac egg gtg gtc gag cgc gcg ctg gac gcg gec 14 88 
Arg Asp Pro Gly Val Asp Arg Val Val Glu Arg Ala Leu Asp Ala Ala 
485 490 495 

gac ccg cgc gag gcg gtc gee ctg tgg cag gag gtg gag egg egg ctg 153 6 
Asp Pro Arg Glu Ala Val Ala Leu Trp Gin Glu Val Glu Arg Arg Leu 
500 505 510 

etc gee gac gec gcg gtc gta ccc ctg ctg ttc egg egg gee acg gac 1584 
Leu Ala Asp Ala Ala Val Val Pro Leu Leu Phe Arg Arg Ala Thr Asp 
515 520 525 

gee gcg ccg cgc ggg ccc egg gtg egg cgc gcg acc gee ctg ccg gcg 1632 
Ala Ala Pro Arg Gly Pro Arg Val Arg Arg Ala Thr Ala Leu Pro Ala 
530 535 540 

etc gec ggc ctg ccc gac etc gec gac gtg egg etc ggg gtg gac egg 1680 



DC01 348871 v 1 



34 



Leu Ala Gly Leu Pro Asp Leu Ala Asp Val Arg Leu Gly Val Asp Arg 
545 550 555 560 



tga 



1683 



<210> 22 
<211> 560 
<212> PRT 
<213> Bacteria 

<400> 22 

Val Thr Gin Ala Arg Ser Ala Thr Thr Thr Asn Asp Thr Arg Leu Arg 

15 10 15 

Gly Thr Leu Arg Leu Leu Gly Pro Ala Ala Val His Gin Ala Asp Pro 

20 25 30 

Ala Ala Ala Trp Ser Pro Ala Glu Arg Gin Leu Leu Arg Leu Cys Thr 

35 40 45 

Arg Gin Leu lie Ser Tyr Arg Pro Glu Pro Asp Pro Gly Asp Trp Arg 

50 55 60 

Ala Leu Ala Pro Val Ala Asp Leu Ala Thr Asp Val Pro Ser Thr Tyr 
65 70 75 80 

Asn Ala Gly Leu Gly Ala Ser His Arg Ser Tyr Val Val His Leu Arg 

85 90 95 

Pro Gly Val Leu Trp Asp Thr Pro Thr Pro Arg Pro Val Thr Ala His 

100 105 110 

Asp Val Val Arg Gly Phe Lys Arg Leu Ala Asn Pro Leu Thr Arg His 

115 120 125 

Pro Ala Leu Ala Tyr Phe Arg Gly Thr Leu Arg Gly Met Gly Arg Tyr 

130 135 140 

Cys Asp Glu Tyr Ala Ala Ala Val Ala Gly His Pro Val Thr Ala Ala 
145 150 155 160 

Leu Leu Ala Gly Phe Gin Asp Ala His Glu lie Pro Gly Val Phe Ala 

165 170 175 

Val Asp Asp Glu Thr Val Val Phe Glu Leu Asp Arg Pro Ala Leu Asp 

180 185 190 

Phe Val Asp Met Leu Ala Gin Ser Gly Ala Ser Pro Ala Pro Val Glu 

195 200 205 

Tyr Asp Ala His Leu Pro Gly Ser Ala Gly Leu His Glu His Leu Val 

210 215 220 

Ala Asn Gly Pro Tyr Arg Val Val Ser Trp Arg Pro Gly Gly Thr lie 
225 230 235 240 

Arg Leu Glu Pro Asn Pro Ala Trp Arg Ala Glu Thr Asp Pro lie Arg 

245 250 255 

Glu Arg Arg Phe Asp Ala Val Glu Phe Arg Val Ala Met Gly Gly Pro 

260 265 270 

Arg Glu Leu Ala Asp Arg Leu Ala Ala Asp Asp Ala Asp Leu Pro Trp 

275 280 285 

Gly Val Pro lie Gly Pro Val Pro Gly Gin Arg Leu Asp Pro Cys Leu 

290 295 300 

Val Phe Asn Leu Arg Asp Pro Ala Asn Pro Ala Val Ala Asp Ala Ala 
305 310 315 320 

Val Arg Arg Val Val Ala Gly Ala Val Asp Arg Ala Ala Leu Val Arg 

325 330 335 

lie Ala Arg Ala Ala Asp Pro Trp Ser Glu Val Arg Ala Ala His Thr 



340 



345 



350 
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Val Val Pro Pro Gly Asn Asp Gly His Arg Gin Pro Asp Pro Leu Thr 

355 360 365 

Asp Pro lie Pro Asp Ala Asp Ala Asp Pro Arg Glu Arg Leu Ala Ala 

370 375 380 

Ala Gly His Pro Asp Gly Leu Thr Leu Thr Ala Val His Pro Asp Thr 
385 390 395 400 

Ala Glu Asp Leu Ala Leu Ala Arg Ser Trp Ala Ala Asp Leu Gly Ala 

405 410 415 

Ala Gly lie Asp Val Arg Leu Val Ala Leu Asp Asp Ala Asn His Arg 

420 425 430 

Ala Leu Leu Ala Ala Thr Gly Asp Ala Pro Gly Leu Arg Trp Asp Leu 

435 440 445 

Ala Thr Ala Thr Phe Thr Ala Pro Trp Ala Tyr Gly Asn Ala Arg Val 

450 455 460 

Phe Leu Gin Pro Leu Val Gly Glu Gly Pro Gly Asn Pro Gly Gly Tyr 
465 470 475 480 

Arg Asp Pro Gly Val Asp Arg Val Val Glu Arg Ala Leu Asp Ala Ala 

485 490 495 

Asp Pro Arg Glu Ala Val Ala Leu Trp Gin Glu Val Glu Arg Arg Leu 

500 505 510 

Leu Ala Asp Ala Ala Val Val Pro Leu Leu Phe Arg Arg Ala Thr Asp 

515 520 525 

Ala Ala Pro Arg Gly Pro Arg Val Arg Arg Ala Thr Ala Leu Pro Ala 

530 535 540 

Leu Ala Gly Leu Pro Asp Leu Ala Asp Val Arg Leu Gly Val Asp Arg 
545 550 555 560 



<210> 23 
<211> 1248 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . - . (1248) 

<400> 23 

gtg acc ggt gcc gcc gcc gac gcg gtg gtc gcc gac tac etc gcg etc 48 
Val Thr Gly Ala Ala Ala Asp Ala Val Val Ala Asp Tyr Leu Ala Leu 
15 10 15 



ggg ctg egg atg ggt egg etc gtc gag ggc tac gtc gac tgc tgg ttc 96 
Gly Leu Arg Met Gly Arg Leu Val Glu Gly Tyr Val Asp Cys Trp Phe 
20 25 30 

ggc gac egg gcc etc gcc gag egg gtc gcc gcg gag ccg gcg ccg gac 144 
Gly Asp Arg Ala Leu Ala Glu Arg Val Ala Ala Glu Pro Ala Pro Asp 
35 40 45 

cc 9 9 C 9 gag ctg gcc gga cag gcc cgc gac ctg ctg cgc cgc ctg ggc 192 
Pro Ala Glu Leu Ala Gly Gin Ala Arg Asp Leu Leu Arg Arg Leu Gly 
50 55 60 

gac gcg gac etc gac gcg gag egg egg egg ttc etc gcc gcg cag ctg 240 
Asp Ala Asp Leu Asp Ala Glu Arg Arg Arg Phe Leu Ala Ala Gin Leu 
65 70 75 80 
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acc gcg gtg gag tgc gcg gcc egg egg gcg gcg ggt gag cag ate ggc 288 
Thr Ala Val Glu Cys Ala Ala Arg Arg Ala Ala Gly Glu Gin lie Gly 
85 90 95 

ttc ctg gcc gag gtg gag acc tac ttc gac gtc gag gtg cgc etc ggc 336 
Phe Leu Ala Glu Val Glu Thr Tyr Phe Asp Val Glu Val Arg Leu Gly 
100 105 110 

gac ccg gac egg tac gcc gcc gcg cac gac gcc ate gac gcg ctg ctg 384 
Asp Pro Asp Arg Tyr Ala Ala Ala His Asp Ala lie Asp Ala Leu Leu 
115 120 125 

cc 9 9"9" c acc ggc ccg ctg atg gac aag gtc gag gcg ttc tac gcc cgc 432 
Pro Gly Thr Gly Pro Leu Met Asp Lys Val Glu Ala Phe Tyr Ala Arg 
130 135 140 

aac gtg gtg ccg ccg gag egg ctg ggc cac gcc gtg egg gcc gtc gcc 48 0 
Asn Val Val Pro Pro Glu Arg Leu Gly His Ala Val Arg Ala Val Ala 
145 150 155 160 

gac gcg ctg cgc gcc cgt gcc egg ccg atg etc ggg ctg ccc gag gcc 528 
Asp Ala Leu Arg Ala Arg Ala Arg Pro Met Leu Gly Leu Pro Glu Ala 
165 170 175 

gag egg gtc gac ate gag gtg gtc cgc gac egg ccg tgg aac gcg ttc 576 
Glu Arg Val Asp lie Glu Val Val Arg Asp Arg Pro Trp Asn Ala Phe 
180 185 190 

aac egg tac cac ggc ggc ttc cgt tec acg gtg acg ctg aac gag acg 624 
Asn Arg Tyr His Gly Gly Phe Arg Ser Thr Val Thr Leu Asn Glu Thr 
195 200 205 

gcc ggc egg acc ate gcc gtg ctg ccg ctg atg gcc acc cac gag gcg 672 
Ala Gly Arg Thr lie Ala Val Leu Pro Leu Met Ala Thr His Glu Ala 
210 215 220 

tac ccg ggc cac cac acc gag cac tgc etc aag gag gcc ggg ctg gtg 72 0 
Tyr Pro Gly His His Thr Glu His Cys Leu Lys Glu Ala Gly Leu Val 
225 230 235 240 

etc gac egg ggc tgg gac gag cac cgc ate gcc ctg gtc aac acc ccg 768 
Leu Asp Arg Gly Trp Asp Glu His Arg lie Ala Leu Val Asn Thr Pro 
245 250 255 

cag tgc ctg gtg gcg gag ggc acc gcc gag cac gcc gcg gcg gcg ctg 816 
Gin Cys Leu Val Ala Glu Gly Thr Ala Glu His Ala Ala Ala Ala Leu 
260 265 270 

etc ggg ccc ggc tgg gga egg tgg acc acc gag gtg ctg gcc ggc gag 864 
Leu Gly Pro Gly Trp Gly Arg Trp Thr Thr Glu Val Leu Ala Gly Glu 
275 280 285 

999 gtg ccc gtc gag ggc gac etc gtc gag egg atg gtg ggg etc gtc 912 
Gly Val Pro Val Glu Gly Asp Leu Val Glu Arg Met Val Gly Leu Val 
290 295 300 

aac gag ctg atg ccg gcc egg cag gac gcg gcg ate ctg ctg cac gac 960 
Asn Glu Leu Met Pro Ala Arg Gin Asp Ala Ala lie Leu Leu His Asp 
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305 



310 



315 



320 



egg ggg gcg teg ate gac gac gcg gtg gag cac ctg cac egg tgg ctg 1008 
Arg Gly Ala Ser lie Asp Asp Ala Val Glu His Leu His Arg Trp Leu 
325 330 335 

ctg ctg ccg egg gac egg gec gag cag ate gee ace ttc ctg ace gac 1056 
Leu Leu Pro Arg Asp Arg Ala Glu Gin lie Ala Thr Phe Leu Thr Asp 
340 345 350 

ccg ctg tgg egg gee tac tec gtg acg tac ate gag ggg gec egg ctg 1104 
Pro Leu Trp Arg Ala Tyr Ser Val Thr Tyr lie Glu Gly Ala Arg Leu 
355 360 365 

gtc ggc ggg tgg etc gee gec egg ccg gec ggc gag ccg etc gtc gcg 1152 
Val Gly Gly Trp Leu Ala Ala Arg Pro Ala Gly Glu Pro Leu Val Ala 
370 375 380 

egg tac cgc ace ctg ctg gcg gag cag etc ctt ccc gcg cag etc cgc 1200 
Arg Tyr Arg Thr Leu Leu Ala Glu Gin Leu Leu Pro Ala Gin Leu Arg 
385 390 395 400 

gac ggc acg gtc ccc gcg ggc gcg ccg ccc gtg ccc gcg gec cgc tga 1248 
Asp Gly Thr Val Pro Ala Gly Ala Pro Pro Val Pro Ala Ala Arg * 
405 410 415 



<210> 24 
<211> 415 
<212> PRT 
<213> Bacteria 



<400> 24 
Val Thr Gly Ala 
1 

Gly Leu Arg Met 
20 

Gly Asp Arg Ala 
35 

Pro Ala Glu Leu 
50 

Asp Ala Asp Leu 
65 

Thr Ala Val Glu 

Phe Leu Ala Glu 
100 

Asp Pro Asp Arg 
115 

Pro Gly Thr Gly 
130 

Asn Val Val Pro 
145 

Asp Ala Leu Arg 

Glu Arg Val Asp 
180 



Ala Ala Asp Ala 
5 

Gly Arg Leu Val 

Leu Ala Glu Arg 
40 

Ala Gly Gin Ala 
55 

Asp Ala Glu Arg 
70 

Cys Ala Ala Arg 
85 

Val Glu Thr Tyr 

Tyr Ala Ala Ala 
120 

Pro Leu Met Asp 
135 

Pro Glu Arg Leu 
150 

Ala Arg Ala Arg 
165 

lie Glu Val Val 



Val Val Ala Asp 
10 

Glu Gly Tyr Val 
25 

Val Ala Ala Glu 

Arg Asp Leu Leu 
60 

Arg Arg Phe Leu 
75 

Arg Ala Ala Gly 
90 

Phe Asp Val Glu 
105 

His Asp Ala lie 

Lys Val Glu Ala 
140 

Gly His Ala Val 
155 

Pro Met Leu Gly 
170 

Arg Asp Arg Pro 
185 



Tyr Leu Ala Leu 
15 

Asp Cys Trp Phe 
30 

Pro Ala Pro Asp 
45 

Arg Arg Leu Gly 

Ala Ala Gin Leu 
80 

Glu Gin He Gly 
95 

Val Arg Leu Gly 
110 

Asp Ala Leu Leu 
125 

Phe Tyr Ala Arg 

Arg Ala Val Ala 
160 

Leu Pro Glu Ala 
175 

Trp Asn Ala Phe 
190 
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Asn Arg Tyr His 
195 

Ala Gly Arg Thr 
210 

Tyr Pro Gly His 
225 

Leu Asp Arg Gly 

Gin Cys Leu Val 
260 

Leu Gly Pro Gly 
275 

Gly Val Pro Val 
290 

Asn Glu Leu Met 
305 

Arg Gly Ala Ser 

Leu Leu Pro Arg 
340 

Pro Leu Trp Arg 
355 

Val Gly Gly Trp 
370 

Arg Tyr Arg Thr 
385 

Asp Gly Thr Val 



Gly Gly Phe Arg 
200 

He Ala Val Leu 

215 

His Thr Glu His 
230 

Trp Asp Glu His 
245 

Ala Glu Gly Thr 

Trp Gly Arg Trp 
280 

Glu Gly Asp Leu 
295 

Pro Ala Arg Gin 
310 

He Asp Asp Ala 
325 

Asp Arg Ala Glu 

Ala Tyr Ser Val 
360 

Leu Ala Ala Arg 
375 

Leu Leu Ala Glu 
390 

Pro Ala Gly Ala 
405 



Ser Thr Val Thr 

Pro Leu Met Ala 

220 

Cys Leu Lys Glu 
235 

Arg lie Ala Leu 
250 

Ala Glu His Ala 
265 

Thr Thr Glu Val 

Val Glu Arg Met 
300 

Asp Ala Ala lie 
315 

Val Glu His Leu 
330 

Gin He Ala Thr 
345 

Thr Tyr He Glu 

Pro Ala Gly Glu 
380 

Gin Leu Leu Pro 
395 

Pro Pro Val Pro 
410 



Leu Asn Glu Thr 
205 

Thr His Glu Ala 

Ala Gly Leu Val 
240 

Val Asn Thr Pro 
255 

Ala Ala Ala Leu 
270 

Leu Ala Gly Glu 
285 

Val Gly Leu Val 

Leu Leu His Asp 
320 

His Arg Trp Leu 
335 

Phe Leu Thr Asp 
350 

Gly Ala Arg Leu 
365 

Pro Leu Val Ala 

Ala Gin Leu Arg 
400 

Ala Ala Arg 
415 



<210> 25 
<211> 1194 
<212> DNA 
<213> Bacteria 



<220> 

<221> CDS 

<222> (1) . . . (1194) 



<400> 25 

atg gcc cac etc ctg ate gtc aac gtc gec age cac ggc ctg ate ctg 48 
Met Ala His Leu Leu He Val Asn Val Ala Ser His Gly Leu He Leu 
15 10 15 



ccc acc etc acc gtg gtc acc gag ctg gtc egg cgc ggg cac egg gtc 96 
Pro Thr Leu Thr Val Val Thr Glu Leu Val Arg Arg Gly His Arg Val 
20 25 30 



age tac gtc acc gcc ggc ggg ttc gcg gag ccg gtc cgt gcc gcc ggc 144 
Ser Tyr Val Thr Ala Gly Gly Phe Ala Glu Pro Val Arg Ala Ala Gly 
35 40 45 



gcg acg gtg gtg ccc tac cag teg gag ate ate gac gcg gac gcc gcc 192 
Ala Thr Val Val Pro Tyr Gin Ser Glu He He Asp Ala Asp Ala Ala 
50 55 60 



gag gtg ttc ggc teg gac gac etc ggc gtc cgt ccc cac ctg atg tac 24 0 
Glu Val Phe Gly Ser Asp Asp Leu Gly Val Arg Pro His Leu Met Tyr 
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65 



70 



75 



80 



ctg egg gag aac gtc teg gtg etc egg gee ace gee gag gcg etc gac 288 
Leu Arg Glu Asn Val Ser Val Leu Arg Ala Thr Ala Glu Ala Leu Asp 
85 90 95 

ggc gac gtg ccg gac ctg gtc etc tac gac gac ttc ccg ttc ate gec 336 
Gly Asp Val Pro Asp Leu Val Leu Tyr Asp Asp Phe Pro Phe lie Ala 
100 105 110 

ggg cag ttg ctg gee gee cgc tgg egg egg ccg gec gtc egg etc age 3 84 
Gly Gin Leu Leu Ala Ala Arg Trp Arg Arg Pro Ala Val Arg Leu Ser 
115 120 125 

gcg gcg ttc gcg teg aac gag cac tac teg ttc tec cag gac atg gtc 432 
Ala Ala Phe Ala Ser Asn Glu His Tyr Ser Phe Ser Gin Asp Met Val 
130 135 140 

ace ctg gee ggc acg ate gac ccg etc gac ctg ccg gtg ttc cgc gac 480 
Thr Leu Ala Gly Thr lie Asp Pro Leu Asp Leu Pro Val Phe Arg Asp 
145 150 155 160 

ace ctg egg gac ctg etc gee gag cac ggc ctg tec egg teg gtc gtg 52 8 
Thr Leu Arg Asp Leu Leu Ala Glu His Gly Leu Ser Arg Ser Val Val 
165 170 175 

gac tgc tgg aac cac gtg gag caa etc aac ctg gtc ttc gtc ccg aag 576 
Asp Cys Trp Asn His Val Glu Gin Leu Asn Leu Val Phe Val Pro Lys 
180 185 190 

gcg ttc cag ate gec ggc gac acc ttc gac gac cgc ttc gtc ttc gtc 624 
Ala Phe Gin lie Ala Gly Asp Thr Phe Asp Asp Arg Phe Val Phe Val 
195 200 205 

ggg ccg tgc ttc gac gac egg egg ttc etc ggc gag tgg acc cgc ccg 672 
Gly Pro Cys Phe Asp Asp Arg Arg Phe Leu Gly Glu Trp Thr Arg Pro 
210 215 220 

gee gac gac ctg ccg gtg gtg ctg gtg teg etc ggc acc acc ttc aac 720 
Ala Asp Asp Leu Pro Val Val Leu Val Ser Leu Gly Thr Thr Phe Asn 
225 230 235 240 

gac egg ccc gga ttc ttc cgc gac tgc gcg egg gcg ttc gac ggc cag 768 
Asp Arg Pro Gly Phe Phe Arg Asp Cys Ala Arg Ala Phe Asp Gly Gin 
245 250 255 

ccg tgg cac gtg gtg atg acg ctg ggc ggc cag gtc gac ccg gcg get 816 
Pro Trp His Val Val Met Thr Leu Gly Gly Gin Val Asp Pro Ala Ala 
260 265 270 

etc ggc gac ctg ccc ccc aac gtg gag gcg cac cgc tgg gtc ccg cac 864 
Leu Gly Asp Leu Pro Pro Asn Val Glu Ala His Arg Trp Val Pro His 
275 280 285 

gtg aag gtg etc gaa cag gcg acg gtc tgc gtg acg cac ggc ggc atg 912 
Val Lys Val Leu Glu Gin Ala Thr Val Cys Val Thr His Gly Gly Met 
290 295 300 
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ggc acc etc atg gag gcg etc tac tgg ggg cgc ccg ctg gtg gtc gtg 
Gly Thr Leu Met Glu Ala Leu Tyr Trp Gly Arg Pro Leu Val Val Val 
305 310 315 320 



960 



ccg cag tec ttc gac gtg cag ccg atg gec egg egg gtc gac cag etc 10 08 
Pro Gin Ser Phe Asp Val Gin Pro Met Ala Arg Arg Val Asp Gin Leu 
325 330 335 

ggc etc ggc gcg gtg ctg ccc ggg gag aag gec gac ggc gac acg ctg 10 56 
Gly Leu Gly Ala Val Leu Pro Gly Glu Lys Ala Asp Gly Asp Thr Leu 
340 345 350 

etc gee gec gtc ggg gee gtc gcg gee gac ccc gcg ctg etc gec egg 1104 
Leu Ala Ala Val Gly Ala Val Ala Ala Asp Pro Ala Leu Leu Ala Arg 
355 360 365 

gtc gag gee atg egg ggc cac gtc cgc egg gee ggg ggc gcg gec egg 1152 
Val Glu Ala Met Arg Gly His Val Arg Arg Ala Gly Gly Ala Ala Arg 
370 375 380 

gec gee gac gee gtg gag gcg tac ctg gee cgc gee cgc tga 1194 
Ala Ala Asp Ala Val Glu Ala Tyr Leu Ala Arg Ala Arg * 
385 390 395 



<210> 26 
<211> 397 
<212> PRT 
<213> Bacteria 

<400> 26 
Met Ala His Leu 
1 

Pro Thr Leu Thr 
20 

Ser Tyr Val Thr 
35 

Ala Thr Val Val 
50 

Glu Val Phe Gly 
65 

Leu Arg Glu Asn 

Gly Asp Val Pro 
100 

Gly Gin Leu Leu 
115 

Ala Ala Phe Ala 
130 

Thr Leu Ala Gly 
145 

Thr Leu Arg Asp 

Asp Cys Trp Asn 
180 

Ala Phe Gin He 
195 



Leu He Val Asn 
5 

Val Val Thr Glu 

Ala Gly Gly Phe 
40 

Pro Tyr Gin Ser 
55 

Ser Asp Asp Leu 
70 

Val Ser Val Leu 
85 

Asp Leu Val Leu 

Ala Ala Arg Trp 
120 

Ser Asn Glu His 
135 

Thr He Asp Pro 
150 

Leu Leu Ala Glu 
165 

His Val Glu Gin 

Ala Gly Asp Thr 
200 



Val Ala Ser His 
10 

Leu Val Arg Arg 
25 

Ala Glu Pro Val 

Glu He He Asp 
60 

Gly Val Arg Pro 
75 

Arg Ala Thr Ala 
90 

Tyr Asp Asp Phe 
105 

Arg Arg Pro Ala 

Tyr Ser Phe Ser 
140 

Leu Asp Leu Pro 
155 

His Gly Leu Ser 
170 

Leu Asn Leu Val 
185 

Phe Asp Asp Arg 



Gly Leu He Leu 
15 

Gly His Arg Val 
30 

Arg Ala Ala Gly 
45 

Ala Asp Ala Ala 

His Leu Met Tyr 
80 

Glu Ala Leu Asp 
95 

Pro Phe He Ala 
110 

Val Arg Leu Ser 
125 

Gin Asp Met Val 

Val Phe Arg Asp 
160 

Arg Ser Val Val 
175 

Phe Val Pro Lys 
190 

Phe Val Phe Val 
205 
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Gly Pro Cys Phe 
210 

Ala Asp Asp Leu 
225 

Asp Arg Pro Gly 

Pro Trp His Val 
260 

Leu Gly Asp Leu 
275 

Val Lys Val Leu 
290 

Gly Thr Leu Met 
305 

Pro Gin Ser Phe 

Gly Leu Gly Ala 
340 

Leu Ala Ala Val 
355 

Val Glu Ala Met 
370 

Ala Ala Asp Ala 
385 



Asp Asp Arg Arg 
215 

Pro Val Val Leu 
230 

Phe Phe Arg Asp 
245 

Val Met Thr Leu 

Pro Pro Asn Val 
280 

Glu Gin Ala Thr 
295 

Glu Ala Leu Tyr 
310 

Asp Val Gin Pro 
325 

Val Leu Pro Gly 

Gly Ala Val Ala 
360 

Arg Gly His Val 
375 

Val Glu Ala Tyr 
390 



Phe Leu Gly Glu 
220 

Val Ser Leu Gly 
235 

Cys Ala Arg Ala 
250 

Gly Gly Gin Val 
265 

Glu Ala His Arg 

Val Cys Val Thr 
300 

Trp Gly Arg Pro 
315 

Met Ala Arg Arg 
330 

Glu Lys Ala Asp 
345 

Ala Asp Pro Ala 

Arg Arg Ala Gly 
380 

Leu Ala Arg Ala 
395 



Trp Thr Arg Pro 

Thr Thr Phe Asn 
240 

Phe Asp Gly Gin 
255 

Asp Pro Ala Ala 
270 

Trp Val Pro His 
285 

His Gly Gly Met 

Leu Val Val Val 
320 

Val Asp Gin Leu 
335 

Gly Asp Thr Leu 
350 

Leu Leu Ala Arg 
365 

Gly Ala Ala Arg 
Arg 



<210> 27 
<211> 993 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (993) 

<400> 27 

gtg teg teg ctg cat gtc egg etc gga egg ace ggc ctg egg gtc age 48 

Val Ser Ser Leu His Val Arg Leu Gly Arg Thr Gly Leu Arg Val Ser 
15 10 15 

c< 39 gtc gee ate ggg acc gtc aac ttc ggc ggc egg gtc gac gag gec 96 
Arg Val Ala lie Gly Thr Val Asn Phe Gly Gly Arg Val Asp Glu Ala 
20 25 30 

gac gee cac egg ctg etc gac cac gee gtc gcg cag ggg gtc aac ctg 144 
Asp Ala His Arg Leu Leu Asp His Ala Val Ala Gin Gly Val Asn Leu 
35 40 45 

gtc gac acc gee gac ate tac ggc tgg egg gtg cac egg ggc tgg acc 192 
Val Asp Thr Ala Asp lie Tyr Gly Trp Arg Val His Arg Gly Trp Thr 
50 55 60 

gag gag atg ate ggg cgc tgg etc gec aag gac ccg gee egg egg gac 240 
Glu Glu Met lie Gly Arg Trp Leu Ala Lys Asp Pro Ala Arg Arg Asp 
65 70 75 80 

gag gtg gtc etc gcg acc aag gtc ggc aat ccc atg ggg gac ggc ccc 288 
Glu Val Val Leu Ala Thr Lys Val Gly Asn Pro Met Gly Asp Gly Pro 
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85 



90 



95 



aac gcc egg ggc ctg teg gec cga cac gtc gtc gec gec tgc gag gcg 336 
Asn Ala Arg Gly Leu Ser Ala Arg His Val Val Ala Ala Cys Glu Ala 
100 105 110 

teg ctg cgc egg etc cag acc gac gcc ate gac etc tac cag atg cac 3 84 
Ser Leu Arg Arg Leu Gin Thr Asp Ala lie Asp Leu Tyr Gin Met His 
115 120 125 

cac gtc gac egg gag gtc ggc tgg gac gag ate tgg cag gcc atg gag 432 
His Val Asp Arg Glu Val Gly Trp Asp Glu lie Trp Gin Ala Met Glu 
130 135 140 

cag etc gtc egg cag ggc aag gtc cgc tac gtc ggg tec teg aac ttc 480 
Gin Leu Val Arg Gin Gly Lys Val Arg Tyr Val Gly Ser Ser Asn Phe 
145 150 155 160 

gcc ggc tgg gac ctg gtg age gcc cag gag gcc gcg cgc egg cac egg 528 
Ala Gly Trp Asp Leu Val Ser Ala Gin Glu Ala Ala Arg Arg His Arg 
165 170 175 

ctg etc ggg ctg gcc age gag cag tgc gtc tac aac ctg gtc age egg 576 
Leu Leu Gly Leu Ala Ser Glu Gin Cys Val Tyr Asn Leu Val Ser Arg 
180 185 190 

tac gtc gaa ctg gag gtg etc ccc gcc gcc gtc gcc gag ggc ate ggg 624 
Tyr Val Glu Leu Glu Val Leu Pro Ala Ala Val Ala Glu Gly lie Gly 
195 200 205 

gtg etc gtc tgg teg ccg ctg cac ggc ggg ctg etc ggc ggc gtg ctg 672 
Val Leu Val Trp Ser Pro Leu His Gly Gly Leu Leu Gly Gly Val Leu 
210 215 220 

egg aag ctg gcc gac ggc acc gcg gtc aag tec gcg cag gga egg gcc 720 
Arg Lys Leu Ala Asp Gly Thr Ala Val Lys Ser Ala Gin Gly Arg Ala 
225 230 235 240 

gcc gag gcg gtc gag egg cac cgc gcg aca etc gcc gcg tac gag acg 768 
Ala Glu Ala Val Glu Arg His Arg Ala Thr Leu Ala Ala Tyr Glu Thr 
245 250 255 

ttc tgc gcc gag gcc ggc cgc gac ccg gcg gag gtc ggc atg gcc tgg 816 
Phe Cys Ala Glu Ala Gly Arg Asp Pro Ala Glu Val Gly Met Ala Trp 
260 265 270 

gtg ctg cac cgc ccg gcg gtg acc gcc gcg gtc gtc ggt ccg cgt acc 864 
Val Leu His Arg Pro Ala Val Thr Ala Ala Val Val Gly Pro Arg Thr 
275 280 285 

ccc gaa cac ctg gac ggc gcc ctg egg gcc ctg cac egg ccg ctg teg 912 
Pro Glu His Leu Asp Gly Ala Leu Arg Ala Leu His Arg Pro Leu Ser 
290 295 300 

gcg gcg gag etc gcc egg etc gac gag ctg ttc ccg ccg etc ggc egg 96 0 
Ala Ala Glu Leu Ala Arg Leu Asp Glu Leu Phe Pro Pro Leu Gly Arg 
305 310 315 320 



DC01 348871 v 1 



43 



ggc ggc gcc gcc ccg gac gcc tgg atg tec tga 
Gly Gly Ala Ala Pro Asp Ala Trp Met Ser * 
325 330 



993 



<210> 28 
<211> 330 
<212> PRT 
<213> Bacteria 



<400> 28 
Val Ser Ser Leu 
1 

Arg Val Ala lie 
20 

Asp Ala His Arg 
35 

Val Asp Thr Ala 
50 

Glu Glu Met lie 
65 

Glu Val Val Leu 

Asn Ala Arg Gly 
100 

Ser Leu Arg Arg 
115 

His Val Asp Arg 
130 

Gin Leu Val Arg 
145 

Ala Gly Trp Asp 

Leu Leu Gly Leu 
180 

Tyr Val Glu Leu 
195 

Val Leu Val Trp 
210 

Arg Lys Leu Ala 
225 

Ala Glu Ala Val 

Phe Cys Ala Glu 
260 

Val Leu His Arg 
275 

Pro Glu His Leu 
290 

Ala Ala Glu Leu 
305 

Gly Gly Ala Ala 



His Val Arg Leu 
5 

Gly Thr Val Asn 

Leu Leu Asp His 
40 

Asp lie Tyr Gly 
55 

Gly Arg Trp Leu 
70 

Ala Thr Lys Val 
85 

Leu Ser Ala Arg 

Leu Gin Thr Asp 
120 

Glu Val Gly Trp 
135 

Gin Gly Lys Val 
150 

Leu Val Ser Ala 
165 

Ala Ser Glu Gin 

Glu Val Leu Pro 
200 

Ser Pro Leu His 
215 

Asp Gly Thr Ala 
230 

Glu Arg His Arg 
245 

Ala Gly Arg Asp 

Pro Ala Val Thr 
280 

Asp Gly Ala Leu 
295 

Ala Arg Leu Asp 
310 

Pro Asp Ala Trp 
325 



Gly Arg Thr Gly 
10 

Phe Gly Gly Arg 
25 

Ala Val Ala Gin 

Trp Arg Val His 
60 

Ala Lys Asp Pro 
75 

Gly Asn Pro Met 
90 

His Val Val Ala 
105 

Ala lie Asp Leu 

Asp Glu lie Trp 
140 

Arg Tyr Val Gly 
155 

Gin Glu Ala Ala 
170 

Cys Val Tyr Asn 
185 

Ala Ala Val Ala 

Gly Gly Leu Leu 
220 

Val Lys Ser Ala 
235 

Ala Thr Leu Ala 
250 

Pro Ala Glu Val 
265 

Ala Ala Val Val 

Arg Ala Leu His 
300 

Glu Leu Phe Pro 
315 

Met Ser 
330 



Leu Arg Val Ser 
15 

Val Asp Glu Ala 
30 

Gly Val Asn Leu 
45 

Arg Gly Trp Thr 

Ala Arg Arg Asp 
80 

Gly Asp Gly Pro 
95 

Ala Cys Glu Ala 
110 

Tyr Gin Met His 
125 

Gin Ala Met Glu 

Ser Ser Asn Phe 
160 

Arg Arg His Arg 
175 

Leu Val Ser Arg 
190 

Glu Gly lie Gly 
205 

Gly Gly Val Leu 

Gin Gly Arg Ala 
240 

Ala Tyr Glu Thr 
255 

Gly Met Ala Trp 
270 

Gly Pro Arg Thr 
285 

Arg Pro Leu Ser 

Pro Leu Gly Arg 
320 



<210> 29 
<211> 543 
<212> DNA 
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<213> Bacteria 



<220> 

<221> CDS 

<222> (1) . . . (543) 

<400> 29 

gga tec egg cag gga tat ggg agg ate gec cac cac aac ate cac ttt 4 8 
Gly Ser Arg Gin Gly Tyr Gly Arg lie Ala His His Asn lie His Phe 
15 10 15 

gga egg tec tgg aag ggc acc ttc gat gag gtc ate egg cgt ggc gag 96 
Gly Arg Ser Trp Lys Gly Thr Phe Asp Glu Val lie Arg Arg Gly Glu 
20 25 30 

ctg atg age gac ccg tec ctg ctg gtg acc aac ccg age egg acg gac 144 
Leu Met Ser Asp Pro Ser Leu Leu Val Thr Asn Pro Ser Arg Thr Asp 
35 40 45 

ccg tec gtg gcg ccc gec ggc egg cac acc tac tac gtg etc gcg ccg 192 
Pro Ser Val Ala Pro Ala Gly Arg His Thr Tyr Tyr Val Leu Ala Pro 
50 55 60 

gtg ccc aac etc cac egg gcg ccg ttc gac tgg egg ggc gac etc acc 240 
Val Pro Asn Leu His Arg Ala Pro Phe Asp Trp Arg Gly Asp Leu Thr 
65 70 75 80 

gac cgc tac gee gac cag etc gtc ggg acc ctg gag gag cgc ggc tac 288 
Asp Arg Tyr Ala Asp Gin Leu Val Gly Thr Leu Glu Glu Arg Gly Tyr 
85 90 95 

gtc ggc ttc ggc gee ggc gtc gag gtg ctg egg gcg gtc acc ccg gec 3 36 
Val Gly Phe Gly Ala Gly Val Glu Val Leu Arg Ala Val Thr Pro Ala 
100 105 110 

gag tgg gcg gag cag ggg atg gee gec ggc acc ccg ttc gee gec gcg 3 84 
Glu Trp Ala Glu Gin Gly Met Ala Ala Gly Thr Pro Phe Ala Ala Ala 
115 120 125 

cac age ttc ttc cag acc ggc ccg ttc cgc ccg teg aac ctg cac egg 432 
His Ser Phe Phe Gin Thr Gly Pro Phe Arg Pro Ser Asn Leu His Arg 
130 135 140 

acg ctg ccg aac gtg gtc ttc gtc ggc tec ggc acc cag ccc ggt gtc 480 
Thr Leu Pro Asn Val Val Phe Val Gly Ser Gly Thr Gin Pro Gly Val 
145 150 155 160 

ggc gtg ccg atg gtg etc ate tec ggc aag etc gee gec ggc cgc ate 52 8 
Gly Val Pro Met Val Leu lie Ser Gly Lys Leu Ala Ala Gly Arg He 
165 170 175 

acc ggg aga tec tga 543 
Thr Gly Arg Ser * 
180 



<210> 30 
<211> 180 
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<212> PRT 
<213> Bacteria 



<400> 30 

Gly Ser Arg Gin Gly Tyr Gly Arg lie Ala His His Asn lie His Phe 

15 10 15 

Gly Arg Ser Trp Lys Gly Thr Phe Asp Glu Val lie Arg Arg Gly Glu 

20 25 30 

Leu Met Ser Asp Pro Ser Leu Leu Val Thr Asn Pro Ser Arg Thr Asp 

35 40 45 

Pro Ser Val Ala Pro Ala Gly Arg His Thr Tyr Tyr Val Leu Ala Pro 

50 55 60 

Val Pro Asn Leu His Arg Ala Pro Phe Asp Trp Arg Gly Asp Leu Thr 
65 70 75 80 

Asp Arg Tyr Ala Asp Gin Leu Val Gly Thr Leu Glu Glu Arg Gly Tyr 

85 90 95 

Val Gly Phe Gly Ala Gly Val Glu Val Leu Arg Ala Val Thr Pro Ala 

100 105 110 

Glu Trp Ala Glu Gin Gly Met Ala Ala Gly Thr Pro Phe Ala Ala Ala 

115 120 125 

His Ser Phe Phe Gin Thr Gly Pro Phe Arg Pro Ser Asn Leu His Arg 

130 135 140 

Thr Leu Pro Asn Val Val Phe Val Gly Ser Gly Thr Gin Pro Gly Val 
145 150 155 160 

Gly Val Pro Met Val Leu lie Ser Gly Lys Leu Ala Ala Gly Arg lie 
165 170 175 

Thr Gly Arg Ser 
180 



<210> 31 
<211> 1362 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (1362) 

<400> 31 

atg ccg ttc ctt ccc gac ccg ggc gaa ccg tec ccg ctg aag gtg gtc 48 

Met Pro Phe Leu Pro Asp Pro Gly Glu Pro Ser Pro Leu Lys Val Val 

15 10 15 

ate gec ggc gec ggc tac gtc ggc acc tgt etc gec gtc acc etc gee 96 
lie Ala Gly Ala Gly Tyr Val Gly Thr Cys Leu Ala Val Thr Leu Ala 
20 25 30 

ggc cgc ggc gec gag gtg gtc gcg gtc gac age gac ccg ggc acc gtc 144 
Gly Arg Gly Ala Glu Val Val Ala Val Asp Ser Asp Pro Gly Thr Val 
35 40 45 

gcg gac ctg egg gec ggc egg tgc egg ctg ccc gag ccc ggc ctg gee 192 
Ala Asp Leu Arg Ala Gly Arg Cys Arg Leu Pro Glu Pro Gly Leu Ala 
50 55 60 

ggc gec gtc egg gac etc gec gcg acc gga egg ctg acg gcg age acg 24 0 
Gly Ala Val Arg Asp Leu Ala Ala Thr Gly Arg Leu Thr Ala Ser Thr 
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65 



70 



75 



80 



teg tac gac ccg gtc ggc gcg gcg gac gtg gtg ate gtg acg gtc ggc 2 88 
Ser Tyr Asp Pro Val Gly Ala Ala Asp Val Val lie Val Thr Val Gly 
85 90 95 



acc ccg acc gac gec ggc cac gag atg gtc acc gac cag etc gtc gcg 336 
Thr Pro Thr Asp Ala Gly His Glu Met Val Thr Asp Gin Leu Val Ala 
100 105 110 



gcg tgc gag cag ate gee ccg egg ctg cgc gec ggg caa ctg gtg ate 3 84 
Ala Cys Glu Gin lie Ala Pro Arg Leu Arg Ala Gly Gin Leu Val lie 
115 120 125 



etc aag teg acg gtc tec ccg ggc acc acc egg acc etc gtc gcg ccc 432 
Leu Lys Ser Thr Val Ser Pro Gly Thr Thr Arg Thr Leu Val Ala Pro 
130 135 140 



ctg ctg gag age ggc ggg ctg gtg cac gag cgc gac ttc ggg ctg gec 480 
Leu Leu Glu Ser Gly Gly Leu Val His Glu Arg Asp Phe Gly Leu Ala 
145 150 155 160 



ttc tgc ccg gag egg etc gee gag ggg gtg gcg ctg gcg cag gtg egg 52 8 
Phe Cys Pro Glu Arg Leu Ala Glu Gly Val Ala Leu Ala Gin Val Arg 
165 170 175 



acg ctg ccg gtg gtg gtg ggt ggg tgc ggc ccg cgc age gec gec gcg 576 
Thr Leu Pro Val Val Val Gly Gly Cys Gly Pro Arg Ser Ala Ala Ala 
180 185 190 



gee gaa egg ttc tgg egg tec gcg etc ggc gtc gac gtc egg cag gtg 624 
Ala Glu Arg Phe Trp Arg Ser Ala Leu Gly Val Asp Val Arg Gin Val 
195 200 205 



ccg teg gec gag tec gec gag gtg gtc aag etc gcg acc aac tgg tgg 672 
Pro Ser Ala Glu Ser Ala Glu Val Val Lys Leu Ala Thr Asn Trp Trp 
210 215 220 



ate gac gcg aac gtg gcg ate gec aac gaa etc gee egg tac tgc gcg 72 0 
lie Asp Ala Asn Val Ala lie Ala Asn Glu Leu Ala Arg Tyr Cys Ala 
225 230 235 240 



gtg ctg ggg gtg gac gtc etc gac gtg ate ggc gcg gcg aac acc ctg 768 
Val Leu Gly Val Asp Val Leu Asp Val lie Gly Ala Ala Asn Thr Leu 
245 250 255 



ccc aag ggc age age atg gtg aac ctg ctg ctg ccg ggg gtg ggt gtc 816 
Pro Lys Gly Ser Ser Met Val Asn Leu Leu Leu Pro Gly Val Gly Val 
260 265 270 



ggc ggc tec tgc ctg acg aag gac ccg tgg atg gcg tgg egg gac ggc 8 64 
Gly Gly Ser Cys Leu Thr Lys Asp Pro Trp Met Ala Trp Arg Asp Gly 
275 280 285 



egg gac egg ggc gtg ccc ctg cgc acg gtc gag acg gec cgc gcg gtc 912 
Arg Asp Arg Gly Val Pro Leu Arg Thr Val Glu Thr Ala Arg Ala Val 
290 295 300 
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aac gac gac atg ccc cgc cac acc gcc gcc gtc ate gec gac gag ctg 
Asn Asp Asp Met Pro Arg His Thr Ala Ala Val lie Ala Asp Glu Leu 
305 310 315 320 



960 



gtc aag ctg gga egg gat egg aac gac acg acg ate gcc gtg etc ggc 1008 
Val Lys Leu Gly Arg Asp Arg Asn Asp Thr Thr He Ala Val Leu Gly 
325 330 335 

gcg gcg ttc aag aac gac acc ggc gac gtc cgc aac acc ccg gtg cgc 1056 
Ala Ala Phe Lys Asn Asp Thr Gly Asp Val Arg Asn Thr Pro Val Arg 
340 345 350 

ggg gtc gtg gcg gcg ctg cgc gac age ggc ttc egg gtc egg ate ttc 1104 
Gly Val Val Ala Ala Leu Arg Asp Ser Gly Phe Arg Val Arg He Phe 
355 360 365 

gac ccg ctg gcc gat ccc gcc gag ate gtc gcc egg ttc ggc acc gcg 1152 
Asp Pro Leu Ala Asp Pro Ala Glu He Val Ala Arg Phe Gly Thr Ala 
370 375 380 

ccg gcg gcg age ctg gac gag gcg gtg age ggg gcg ggc tgc ctg gcc 120 0 
Pro Ala Ala Ser Leu Asp Glu Ala Val Ser Gly Ala Gly Cys Leu Ala 
385 390 395 400 

ttc etc gcc ggg cac cgc cag ttc cac gag etc gac ttc ggc gcc ctg 1248 
Phe Leu Ala Gly His Arg Gin Phe His Glu Leu Asp Phe Gly Ala Leu 
405 410 415 

gcc gag egg gtg gac gag ccc tgc ctg gtc ttc gac ggc cgc atg cac 12 96 
Ala Glu Arg Val Asp Glu Pro Cys Leu Val Phe Asp Gly Arg Met His 
420 425 430 

etc ccg ccg gcg cgc ate cgc gag ctg cac egg ttc ggc ttc gcc tac 1344 
Leu Pro Pro Ala Arg He Arg Glu Leu His Arg Phe Gly Phe Ala Tyr 
435 440 445 

cgc ggc att gga agg tga 1362 
Arg Gly He Gly Arg * 
450 



<210> 32 
<211> 453 
<212> PRT 
<213> Bacteria 



<400> 32 

Met Pro Phe Leu Pro Asp Pro Gly Glu Pro Ser Pro Leu Lys Val Val 

15 10 15 

lie Ala Gly Ala Gly Tyr Val Gly Thr Cys Leu Ala Val Thr Leu Ala 

20 25 30 

Gly Arg Gly Ala Glu Val Val Ala Val Asp Ser Asp Pro Gly Thr Val 

35 40 45 

Ala Asp Leu Arg Ala Gly Arg Cys Arg Leu Pro Glu Pro Gly Leu Ala 

50 55 60 

Gly Ala Val Arg Asp Leu Ala Ala Thr Gly Arg Leu Thr Ala Ser Thr 
65 70 75 80 

Ser Tyr Asp Pro Val Gly Ala Ala Asp Val Val He Val Thr Val Gly 
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Thr Pro Thr Asp 
100 

Ala Cys Glu Gin 
115 

Leu Lys Ser Thr 
130 

Leu Leu Glu Ser 
145 

Phe Cys Pro Glu 

Thr Leu Pro Val 
180 

Ala Glu Arg Phe 
195 

Pro Ser Ala Glu 
210 

lie Asp Ala Asn 
225 

Val Leu Gly Val 

Pro Lys Gly Ser 
260 

Gly Gly Ser Cys 
275 

Arg Asp Arg Gly 
290 

Asn Asp Asp Met 
305 

Val Lys Leu Gly 

Ala Ala Phe Lys 
340 

Gly Val Val Ala 
355 

Asp Pro Leu Ala 
370 

Pro Ala Ala Ser 
385 

Phe Leu Ala Gly 

Ala Glu Arg Val 
420 

Leu Pro Pro Ala 

435 

Arg Gly lie Gly 
450 



85 

Ala Gly His Glu 

lie Ala Pro Arg 
12 0 

Val Ser Pro Gly 
135 

Gly Gly Leu Val 
150 

Arg Leu Ala Glu 
165 

Val Val Gly Gly 

Trp Arg Ser Ala 
200 

Ser Ala Glu Val 
215 

Val Ala lie Ala 
230 

Asp Val Leu Asp 
245 

Ser Met Val Asn 

Leu Thr Lys Asp 
280 

Val Pro Leu Arg 
295 

Pro Arg His Thr 
310 

Arg Asp Arg Asn 
325 

Asn Asp Thr Gly 

Ala Leu Arg Asp 
360 

Asp Pro Ala Glu 
375 

Leu Asp Glu Ala 
390 

His Arg Gin Phe 
405 

Asp Glu Pro Cys 

Arg lie Arg Glu 
440 

Arg 



90 

Met Val Thr Asp 
105 

Leu Arg Ala Gly 

Thr Thr Arg Thr 
140 

His Glu Arg Asp 
155 

Gly Val Ala Leu 
170 

Cys Gly Pro Arg 
185 

Leu Gly Val Asp 

Val Lys Leu Ala 
220 

Asn Glu Leu Ala 
235 

Val lie Gly Ala 
250 

Leu Leu Leu Pro 
265 

Pro Trp Met Ala 

Thr Val Glu Thr 
300 

Ala Ala Val lie 
315 

Asp Thr Thr lie 
330 

Asp Val Arg Asn 
345 

Ser Gly Phe Arg 

lie Val Ala Arg 
380 

Val Ser Gly Ala 
395 

His Glu Leu Asp 
410 

Leu Val Phe Asp 
425 

Leu His Arg Phe 



95 

Gin Leu Val Ala 
110 

Gin Leu Val lie 
12 5 

Leu Val Ala Pro 

Phe Gly Leu Ala 
160 

Ala Gin Val Arg 
175 

Ser Ala Ala Ala 
190 

Val Arg Gin Val 
205 

Thr Asn Trp Trp 

Arg Tyr Cys Ala 
240 

Ala Asn Thr Leu 
255 

Gly Val Gly Val 
270 

Trp Arg Asp Gly 
285 

Ala Arg Ala Val 

Ala Asp Glu Leu 
320 

Ala Val Leu Gly 
335 

Thr Pro Val Arg 
350 

Val Arg lie Phe 
365 

Phe Gly Thr Ala 

Gly Cys Leu Ala 
400 

Phe Gly Ala Leu 
415 

Gly Arg Met His 
430 

Gly Phe Ala Tyr 
445 



<210> 33 
<211> 843 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (843) 
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<400> 33 

atg gag cag tec egg gag gcg get gcg gcg egg gee ate gac tac atg 
Met Glu Gin Ser Arg Glu Ala Ala Ala Ala Arg Ala lie Asp Tyr Met 
15 10 15 

cgc egg cac ctg teg gag ccg ctg cag ctg gec gac ctg gec egg gtt 
Arg Arg His Leu Ser Glu Pro Leu Gin Leu Ala Asp Leu Ala Arg Val 
20 25 30 



48 



96 



gtc ccc ttc age ccg ttc cac ttc cac egg ctg ttc cgc gac gtg acc 144 
Val Pro Phe Ser Pro Phe His Phe His Arg Leu Phe Arg Asp Val Thr 
35 40 45 

acg atg acc ccg gee cgc ttc etc gee gcg ctg egg atg gcg gag gee 192 
Thr Met Thr Pro Ala Arg Phe Leu Ala Ala Leu Arg Met Ala Glu Ala 
50 55 60 

cgc egg atg ctg ctg cac tec ggc ctg acg gtg acc gcg ate age ggc 24 0 
Arg Arg Met Leu Leu His Ser Gly Leu Thr Val Thr Ala lie Ser Gly 
65 70 75 80 

cac gtc ggc tac acg age gcg ggg acc ttc acc acc cag ttc tec egg 288 
His Val Gly Tyr Thr Ser Ala Gly Thr Phe Thr Thr Gin Phe Ser Arg 
85 90 95 

ctg gtc ggc acg teg ccg ggg cac ttc egg cag atg tec egg ctg ctg 336 
Leu Val Gly Thr Ser Pro Gly His Phe Arg Gin Met Ser Arg Leu Leu 
100 105 110 

gec ggc egg ccc tgc cac gtc ctg gcg ggc tgg ctg cgc aac gee gtc 3 84 
Ala Gly Arg Pro Cys His Val Leu Ala Gly Trp Leu Arg Asn Ala Val 
115 120 125 

acg gag gtc acc cga ccc egg ctg gtg ctg cac gtg ccc gag age gag 432 
Thr Glu Val Thr Arg Pro Arg Leu Val Leu His Val Pro Glu Ser Glu 
130 135 140 

ccc ggc gac ctg gtg ctg gtc ggc ctg egg gec gac ggg gag gee gcg 480 
Pro Gly Asp Leu Val Leu Val Gly Leu Arg Ala Asp Gly Glu Ala Ala 
145 150 155 160 

gac gcg teg acc acg tgg gcg gtg gcg gee gac ggc gcg cag gtc ccg 528 
Asp Ala Ser Thr Thr Trp Ala Val Ala Ala Asp Gly Ala Gin Val Pro 
165 170 175 

gtg gtg gcg egg ccg ggc ccg tac cag gee egg gtc gtg ctg gtc egg 576 
Val Val Ala Arg Pro Gly Pro Tyr Gin Ala Arg Val Val Leu Val Arg 
180 185 190 

ggc gac age acg ctg acc cgc gee ctg gtg gac gag gag ccc acc age 624 
Gly Asp Ser Thr Leu Thr Arg Ala Leu Val Asp Glu Glu Pro Thr Ser 
195 200 205 

cat ctg gtc ggc acc gee gaa ctg gtg ctg ccc cag gac ggc tgc gcg 672 
His Leu Val Gly Thr Ala Glu Leu Val Leu Pro Gin Asp Gly Cys Ala 
210 215 220 



gee gtc ccg gtc acc acc gcg ccg ccg egg ccg acc gac ccg ccg gcg 



720 
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Ala Val Pro Val Thr Thr Ala Pro Pro Arg Pro Thr Asp Pro Pro Ala 
225 230 235 240 



ctg gcc etc ggc ccg gtg tgc egg etc gtc gag acg ttc acg egg ctg 768 
Leu Ala Leu Gly Pro Val Cys Arg Leu Val Glu Thr Phe Thr Arg Leu 
245 250 255 

gcc ggc ccg teg ggc egg ccg ggt ccg gcc tgg teg gcc ggc cgc ace 816 
Ala Gly Pro Ser Gly Arg Pro Gly Pro Ala Trp Ser Ala Gly Arg Thr 
260 265 270 

gcg ctg gcc gcg gcg gcc ate gcg tga 843 
Ala Leu Ala Ala Ala Ala lie Ala * 
275 280 



<210> 34 
<211> 280 
<212> PRT 
<213> Bacteria 

<400> 34 
Met Glu Gin Ser 
1 

Arg Arg His Leu 
20 

Val Pro Phe Ser 
35 

Thr Met Thr Pro 
50 

Arg Arg Met Leu 
65 

His Val Gly Tyr 

Leu Val Gly Thr 
100 

Ala Gly Arg Pro 
115 

Thr Glu Val Thr 
130 

Pro Gly Asp Leu 
145 

Asp Ala Ser Thr 

Val Val Ala Arg 
180 

Gly Asp Ser Thr 
195 

His Leu Val Gly 
210 

Ala Val Pro Val 
225 

Leu Ala Leu Gly 

Ala Gly Pro Ser 
260 

Ala Leu Ala Ala 

275 



Arg Glu Ala Ala 
5 

Ser Glu Pro Leu 

Pro Phe His Phe 
40 

Ala Arg Phe Leu 
55 

Leu His Ser Gly 
70 

Thr Ser Ala Gly 
85 

Ser Pro Gly His 

Cys His Val Leu 
120 

Arg Pro Arg Leu 
135 

Val Leu Val Gly 
150 

Thr Trp Ala Val 
165 

Pro Gly Pro Tyr 

Leu Thr Arg Ala 
200 

Thr Ala Glu Leu 
215 

Thr Thr Ala Pro 
230 

Pro Val Cys Arg 
245 

Gly Arg Pro Gly 
Ala Ala lie Ala 

280 



Ala Ala Arg Ala 
10 

Gin Leu Ala Asp 

25 

His Arg Leu Phe 

Ala Ala Leu Arg 
60 

Leu Thr Val Thr 
75 

Thr Phe Thr Thr 
90 

Phe Arg Gin Met 
105 

Ala Gly Trp Leu 

Val Leu His Val 
140 

Leu Arg Ala Asp 
155 

Ala Ala Asp Gly 
170 

Gin Ala Arg Val 
185 

Leu Val Asp Glu 

Val Leu Pro Gin 
220 

Pro Arg Pro Thr 
235 

Leu Val Glu Thr 
250 

Pro Ala Trp Ser 
265 



lie Asp Tyr Met 
15 

Leu Ala Arg Val 
30 

Arg Asp Val Thr 
45 

Met Ala Glu Ala 

Ala lie Ser Gly 
80 

Gin Phe Ser Arg 
95 

Ser Arg Leu Leu 
110 

Arg Asn Ala Val 
125 

Pro Glu Ser Glu 

Gly Glu Ala Ala 
160 

Ala Gin Val Pro 
175 

Val Leu Val Arg 
190 

Glu Pro Thr Ser 
205 

Asp Gly Cys Ala 

Asp Pro Pro Ala 
240 

Phe Thr Arg Leu 
255 

Ala Gly Arg Thr 
270 
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<210> 35 
<211> 1209 
<212> DNA 
<213> Bacteria 



<220> 

<221> CDS 

<222> (1) . . . (1209) 



<400> 35 

gtg ctg gtc gat gcc gtg acc gcg ttc gat ccc acc gac gcc gac gtg 48 
Val Leu Val Asp Ala Val Thr Ala Phe Asp Pro Thr Asp Ala Asp Val 
15 10 15 



egg cgt gac ccc tac ccg tec tac cac tgg ctg ctg egg cac gac ccg 96 
Arg Arg Asp Pro Tyr Pro Ser Tyr His Trp Leu Leu Arg His Asp Pro 
20 25 30 

gtg cac cgt ggc gcc cac egg gtc tgg tac gtc tec cgc ttc gcg gac 144 
Val His Arg Gly Ala His Arg Val Trp Tyr Val Ser Arg Phe Ala Asp 
35 40 45 



gtg cgc gcg gtg etc ggc gac gag cgc ttc gcc egg acc ggc ate cgc 192 
Val Arg Ala Val Leu Gly Asp Glu Arg Phe Ala Arg Thr Gly lie Arg 
50 55 60 



egg ttc tgg acc gac etc gtc ggg ccc ggg ctg etc gcc gag ate gtc 240 
Arg Phe Trp Thr Asp Leu Val Gly Pro Gly Leu Leu Ala Glu lie Val 
65 70 75 80 



ggc gac ate ate ctg ttc cag gac gag ccc gac cac ggc egg ctg cgc 
Gly Asp lie lie Leu Phe Gin Asp Glu Pro Asp His Gly Arg Leu Arg 
85 90 95 



288 



ggg gtg gtc ggc ccg gcg ttc teg ccg tec gcg ctg cgc egg ctg gaa 336 
Gly Val Val Gly Pro Ala Phe Ser Pro Ser Ala Leu Arg Arg Leu Glu 
100 105 110 



ccg gtg ate gcc ggc acc gtg gac gac ctg ctg egg ccc gcc ctg gcc 
Pro Val lie Ala Gly Thr Val Asp Asp Leu Leu Arg Pro Ala Leu Ala 
115 120 125 



384 



egg ggc gcg atg gac gtg gtc gac gag ctg gcg tac ccg ctg gcg ctg 432 
Arg Gly Ala Met Asp Val Val Asp Glu Leu Ala Tyr Pro Leu Ala Leu 
130 135 140 



cgc gcg gtc etc ggc ctg etc ggc ctg ccc gcc gcc gac tgg ggg gcg 
Arg Ala Val Leu Gly Leu Leu Gly Leu Pro Ala Ala Asp Trp Gly Ala 
145 150 155 160 



gtc ggg cgc tgg teg cgc gac gtg gga egg acc ctg gac egg ggc gcc 52 8 
Val Gly Arg Trp Ser Arg Asp Val Gly Arg Thr Leu Asp Arg Gly Ala 
165 170 175 



age gcc gag gac atg cgc cgc ggc cac gcg gcg ate gcc gag ttc gcc 5 76 
Ser Ala Glu Asp Met Arg Arg Gly His Ala Ala He Ala Glu Phe Ala 
180 185 190 
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864 



gac tac gtg gag egg gec etc gcg agg egg egg cgt gag ggc ggc gag 624 
Asp Tyr Val Glu Arg Ala Leu Ala Arg Arg Arg Arg Glu Gly Gly Glu 
195 200 205 

gac ctg ctg gcg ttg atg etc gac gee cac gac cgc ggc ctg atg agt 6 72 
Asp Leu Leu Ala Leu Met Leu Asp Ala His Asp Arg Gly Leu Met Ser 
210 215 220 

cgc aac gag ate gtc age acg gtg gtc acg ttc ate ttc acc ggc cac 720 
Arg Asn Glu lie Val Ser Thr Val Val Thr Phe lie Phe Thr Gly His 
225 230 235 240 

gag acg gtg gec age cag gtg ggc aac gec gtg ctg age ctg ctg gcg 768 
Glu Thr Val Ala Ser Gin Val Gly Asn Ala Val Leu Ser Leu Leu Ala 
245 250 255 

cac ccg gac cag etc gac ctg etc egg cgc egg ccg gac ctg ctg gec 816 
His Pro Asp Gin Leu Asp Leu Leu Arg Arg Arg Pro Asp Leu Leu Ala 
260 265 270 

cag gec gtc gag gag tgc ctg egg tac gac ccg teg gtg cag tec aac 
Gin Ala Val Glu Glu Cys Leu Arg Tyr Asp Pro Ser Val Gin Ser Asn 
275 280 285 

acc egg cag etc gac gtc gac gtg gag ctg cgc ggt egg egg ctg cgc 912 
Thr Arg Gin Leu Asp Val Asp Val Glu Leu Arg Gly Arg Arg Leu Arg 
290 295 300 

cgc gac gac gtc gtg gtc gtc ctg gee ggc gcg gcg aac egg gac ccg 96 0 
Arg Asp Asp Val Val Val Val Leu Ala Gly Ala Ala Asn Arg Asp Pro 
305 310 315 320 

egg egg tac gac egg ccc gac gat ttc gac ate gag egg gat ccg gtc 
Arg Arg Tyr Asp Arg Pro Asp Asp Phe Asp lie Glu Arg Asp Pro Val 
325 330 335 

ccg teg atg tec ttc ggc gcg ggg atg cgc tac tgc etc ggg tec tac 1056 
Pro Ser Met Ser Phe Gly Ala Gly Met Arg Tyr Cys Leu Gly Ser Tyr 
340 345 350 

etc gee cgt acg cag ctg cgc gee gcg gtg gee gec ctg gee cga ctg 1104 
Leu Ala Arg Thr Gin Leu Arg Ala Ala Val Ala Ala Leu Ala Arg Leu 
355 360 365 



1008 



ccg ggc ctg egg ctg ggc tgc gcg teg gac gee ctg gec tat cag ccg 1152 
Pro Gly Leu Arg Leu Gly Cys Ala Ser Asp Ala Leu Ala Tyr Gin Pro 
370 375 380 

cgc acc atg ttc egg ggc ctg gec age ctg ccg ate gcg ttc acg ccg 1200 
Arg Thr Met Phe Arg Gly Leu Ala Ser Leu Pro lie Ala Phe Thr Pro 
385 390 395 400 

ggc ggt tga 12 09 

Gly Gly * 
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<210> 36 
<211> 402 
<212> PRT 
<213> Bacteria 



<400> 36 



Val 


Leu 


Val 


Asp 


Ala 


Val 


Thr 


Ala 


Phe 


Asp 


Pro 


Thr 


Asp 


Ala 


Asp 


Val 


1 








5 










10 










15 




Arg 


Arg 


Asp 


Pro 


Tyr 


Pro 


Ser 


Tyr 


His 


Trp 


Leu 


Leu Arg 


His 


Asp 


Pro 








20 










25 










30 






Val 


His 


Arg 
35 


Gly 


Ala 


His 


Arg 


Val 
40 


Trp 


Tyr 


Val 


Ser 


Arg 
45 


Phe 


Ala 


Asp 


Val 


Arg 


Ala 


Val 


Leu 


Gly Asp 


Glu 


Arg 


Phe 


Ala 


Arg 


Thr 


Gly 


He 


Arg 




50 




















60 










Arg 


Phe 


Trp 


Thr 


Asp 


Leu 


Val 


Gly 


Pro 


Gly 


Leu 


Leu 


Ala 


Glu 


He 


Val 


65 










1 U 










75 










80 


Gly Asp 


lie 


He 


Leu 


Phe 


Cjin 


Asp 


Glu 


Pro 


Asp 


His 


Gly Arg 


Leu 


Arg 










85 










90 










95 




Gly Val 


Val 


Gly 


Pro 


Ala 


Phe 


Ser 


Pro 


Ser 


Ala 


Leu 


Arg 


Arg 


Leu 


Glu 








100 










105 










110 






Pro 


Val 


He 
115 


Ala 


Gly 


Thr 


Val 


Asp 
120 


Asp 


Leu 


Leu 


Arg 


Pro 
125 


Ala 


Leu 


Ala 


Arg 


Gly Ala 


Met 


Asp 


Val 


Val 


Asp 


Glu 


Leu 


Ala 


Tyr 


Pro 


Leu 


Ala 


Leu 




130 










135 










140 










Arg 


Ala 


Val 


Leu 


Gly 


Leu 


Leu 


Glv 


Leu 


Pro 


Ala 


Ala 


Asp 


Trp 


Gly Ala 


145 










150 










155 










160 


Val 


Gly Arg 


1 ip 


Ser 


Arg 


Asp 


Val 


Gly Arg 


Thr 


Leu Asp 


Arg 


Gly Ala 










165 










170 










175 




Ser 


Ala 


Glu 


Asp 
180 


Met 


Arg 


Arg 


Glv 


His 
185 


Ala 


Ala 


He 


Ala 


Glu 
190 


Phe 


Ala 


Asp 


Tyr 


Val 


Glu 


Arg 


Ala 


Leu 


Ala 


Arg 


Arg 


Arg 


Arg 


Glu 


Gly 


Gly Glu 






195 










200 










205 








Asp 


Leu 


Leu 


Ala 


Leu 


Met 


Leu 


Asp 


Ala 


His 


Asp 


Arg 


Gly Leu 


Met 


Ser 




210 










215 










220 










Arg 


Asn 


Glu 


He 


Val 


Ser 


Thr 


Val 


Val 


Thr 


Phe 


He 


Phe 


Thr 


Gly His 


225 










230 










235 










240 


Glu 


Thr 


Val 


Ala 


Ser 
245 


Gin 


Val 


Glv 


Asn 


Ala 
250 


Val 


Leu 


Ser 


Leu 


Leu 
255 


Ala 


His 


Pro 


Asp 


Gin 


Leu Asp 


Leu 


Leu 


Arg 


Arg 


Arg 


Pro 


Asp 


Leu 


Leu 


Ala 








260 










265 










270 






Gin 


Ala 


Val 
275 


Glu 


Glu 


Cys 


Leu 


Arg 
280 


Tyr 


Asp 


Pro 


Ser 


Val 
285 


Gin 


Ser 


Asn 


Thr 


Arg 


Gin 


Leu 


Asp 


Val 


Asp 


Val 


Glu 


Leu 


Arg 


Gly Arg 


Arg 


Leu 


Arg 




290 










295 










300 










Arg 


Asp 


Asp 


Val 


Val 


Val 


Val 


Leu 


Ala 


Gly Ala 


Ala 


Asn 


Arg 


Asp 


Pro 


305 










310 










315 










320 


Arg 


Arg 


Tyr 


Asp 


Arg 

325 


Pro 


Asp 


Asp 


Phe 


Asp 
330 


He 


Glu 


Arg 


Asp 


Pro 
335 


Val 


Pro 


Ser 


Met 


Ser 


Phe 


Gly Ala 


Gly 


Met 


Arg 


Tyr 


Cys 


Leu Gly 


Ser 


Tyr 








340 










345 










350 






Leu 


Ala 


Arg 


Thr 


Gin 


Leu Arg 


Ala 


Ala 


Val 


Ala 


Ala 


Leu 


Ala 


Arg 


Leu 






355 










360 










365 








Pro 


Gly 
370 


Leu 


Arg 


Leu 


Gly 


Cys 
375 


Ala 


Ser 


Asp 


Ala 


Leu 
380 


Ala 


Tyr 


Gin 


Pro 


Arg 


Thr 


Met 


Phe 


Arg 


Gly 


Leu 


Ala 


Ser 


Leu 


Pro 


He 


Ala 


Phe 


Thr 


Pro 



385 390 395 400 

Gly Gly 
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<210> 37 
<211> 1263 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . - (1263) 

<400> 37 

atg age cac ccc gaa ccc gag tac gac gtg ate gtc gtg ggc ggc ggc 48 
Met Ser His Pro Glu Pro Glu Tyr Asp Val He Val Val Gly Gly Gly 
5 10 15 



1 



gcg ggc gtc age tac cag ttc egg ggt gag gec gac ccg ate gac gec 
Ala Gly Val Ser Tyr Gin Phe Arg Gly Glu Ala Asp Pro He Asp Ala 



145 



150 155 160 



96 



ccg gee gga teg age acg gee ggt ctg etc gec cag gag ggc cac egg 
Pro Ala Gly Ser Ser Thr Ala Gly Leu Leu Ala Gin Glu Gly His Arg 
20 25 30 

gtc ctg ctg ctg gag cgc gag aag ttc ccc cgc tac cac ate ggc gag 144 
Val Leu Leu Leu Glu Arg Glu Lys Phe Pro Arg Tyr His He Gly Glu 
35 40 45 

tec ctg ate age ggc gtc ace etc ace ctg gac gcg etc ggc gta cgc 192 
Ser Leu He Ser Gly Val Thr Leu Thr Leu Asp Ala Leu Gly Val Arg 
50 55 60 

gag egg atg gcg gag ctg cgc ttc cag ate aaa cac ggc ggc age ctg 24 0 
Glu Arg Met Ala Glu Leu Arg Phe Gin He Lys His Gly Gly Ser Leu 
65 70 75 80 

etg tgg ggg gec gat cag ace gee ccg tgg teg ttc egg ttc egg gag 
Leu Trp Gly Ala Asp Gin Thr Ala Pro Trp Ser Phe Arg Phe Arg Glu 
85 90 95 

ate cgc gac gee egg ttc gac tac teg tgg cag gtc egg cgt gee gaa 336 
He Arg Asp Ala Arg Phe Asp Tyr Ser Trp Gin Val Arg Arg Ala Glu 
100 105 HO 

ttc gac gcg atg ctg ctg gac egg gcg egg gaa ctg ggc gtg gtg gtg 3 84 
Phe Asp Ala Met Leu Leu Asp Arg Ala Arg Glu Leu Gly Val Val Val 
115 120 125 

gtc gag gga gee ace gtc egg ggg ccg ctg ace gac ggc gag egg gtc 432 
Val Glu Gly Ala Thr Val Arg Gly Pro Leu Thr Asp Gly Glu Arg Val 
130 135 140 



288 



480 



cgc gee gcg ate gtg gtc gac gcg teg ggg cag cag cgc tgg etc ggc 52 8 
Arg Ala Ala He Val Val Asp Ala Ser Gly Gin Gin Arg Trp Leu Gly 
165 170 175 

egg cac ttc ggg ttg gtc tec tgg cac gac gac ctg cgc aac atg gcg 576 
Arg His Phe Gly Leu Val Ser Trp His Asp Asp Leu Arg Asn Met Ala 
180 185 190 
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gcg tgg age tac tac gec ggg gcg ctg cgc tac ccc ggc gat cac gag 
Ala Trp Ser Tyr Tyr Ala Gly Ala Leu Arg Tyr Pro Gly Asp His Glu 
195 200 205 



ggc gac ctg etc gtc gag age tgc gec cag ggt tgg etc tgg tac gcg 
Gly Asp Leu Leu Val Glu Ser Cys Ala Gin Gly Trp Leu Trp Tyr Ala 
210 215 220 



ccg ctg age ccc acc ctg acc ggg ate ggg tac gtc ace ccg teg gac 
Pro Leu Ser Pro Thr Leu Thr Gly He Gly Tyr Val Thr Pro Ser Asp 
225 230 235 240 



egg ttc gec gag acc ggc ctt ccc ccg gat cag ttg ctg gag aaa cag 
Arg Phe Ala Glu Thr Gly Leu Pro Pro Asp Gin Leu Leu Glu Lys Gin 
245 250 255 



ate gcg gag teg aac gag gtc tec tgg etc acc gee ggc gcg egg egg 
He Ala Glu Ser Asn Glu Val Ser Trp Leu Thr Ala Gly Ala Arg Arg 
260 265 270 



gtc gac gtc tac cgc acc gcg egg gac tgg teg tac gcg tgc age cag 
Val Asp Val Tyr Arg Thr Ala Arg Asp Trp Ser Tyr Ala Cys Ser Gin 
275 280 285 



ttc tec ggg ccg ggc tgg gtg ctg gtc ggt gac gee gee gee ttc ate 
Phe Ser Gly Pro Gly Trp Val Leu Val Gly Asp Ala Ala Ala Phe He 
290 295 300 



gac ccc ctg ctg tec tec ggc gtg acg ctg gcg atg cgc ggc gcg etc 
Asp Pro Leu Leu Ser Ser Gly Val Thr Leu Ala Met Arg Gly Ala Leu 
305 310 315 320 



age ctg tec egg gcg gtg cac gag gca ctg gec gcg ccg gag aag gag 
Ser Leu Ser Arg Ala Val His Glu Ala Leu Ala Ala Pro Glu Lys Glu 
325 330 335 



cgc cat etc atg cag gtg tac gag gac cgc tac egg gac ttc etc gec 
Arg His Leu Met Gin Val Tyr Glu Asp Arg Tyr Arg Asp Phe Leu Ala 
340 345 350 



gec ctg ctg gat ctg ate egg ttc ttc tac gac ggc gcg cac ggc cgc 
Ala Leu Leu Asp Leu He Arg Phe Phe Tyr Asp Gly Ala His Gly Arg 
355 360 365 



gac gag ttg cac ctg cgc gec cag gee ate gtg gac ccg gac egg ctg 
Asp Glu Leu His Leu Arg Ala Gin Ala He Val Asp Pro Asp Arg Leu 
370 375 380 



atg cct ccg aag ate teg ttc gtc tec ctg ctg teg ggg ctg gcg egg 
Met Pro Pro Lys He Ser Phe Val Ser Leu Leu Ser Gly Leu Ala Arg 
385 390 395 400 



ggc gac gag acg etc gac cgc age cct egg acg gec att gac cga ccg 
Gly Asp Glu Thr Leu Asp Arg Ser Pro Arg Thr Ala He Asp Arg Pro 
405 410 415 



tea gac get ata taa 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1263 
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Ser Asp Ala lie 
420 



<210> 38 
<211> 420 
<212> PRT 
<213> Bacteria 



<400> 38 
Met Ser His Pro 
1 

Pro Ala Gly Ser 
20 

Val Leu Leu Leu 
35 

Ser Leu lie Ser 
50 

Glu Arg Met Ala 
65 

Leu Trp Gly Ala 

lie Arg Asp Ala 
100 

Phe Asp Ala Met 
115 

Val Glu Gly Ala 
130 

Ala Gly Val Ser 
145 

Arg Ala Ala lie 

Arg His Phe Gly 
180 

Ala Trp Ser Tyr 
195 

Gly Asp Leu Leu 
210 

Pro Leu Ser Pro 
225 

Arg Phe Ala Glu 

lie Ala Glu Ser 
260 

Val Asp Val Tyr 
275 

Phe Ser Gly Pro 
290 

Asp Pro Leu Leu 
305 

Ser Leu Ser Arg 

Arg His Leu Met 
340 

Ala Leu Leu Asp 
355 

Asp Glu Leu His 
370 



Glu Pro Glu Tyr 
5 

Ser Thr Ala Gly 

Glu Arg Glu Lys 
40 

Gly Val Thr Leu 

55 

Glu Leu Arg Phe 
70 

Asp Gin Thr Ala 
85 

Arg Phe Asp Tyr 

Leu Leu Asp Arg 
120 

Thr Val Arg Gly 
135 

Tyr Gin Phe Arg 
150 

Val Val Asp Ala 
165 

Leu Val Ser Trp 

Tyr Ala Gly Ala 
200 

Val Glu Ser Cys 
215 

Thr Leu Thr Gly 
230 

Thr Gly Leu Pro 
245 

Asn Glu Val Ser 

Arg Thr Ala Arg 
280 

Gly Trp Val Leu 
295 

Ser Ser Gly Val 
310 

Ala Val His Glu 
325 

Gin Val Tyr Glu 

Leu lie Arg Phe 
360 

Leu Arg Ala Gin 
375 



Asp Val He Val 
10 

Leu Leu Ala Gin 
25 

Phe Pro Arg Tyr 

Thr Leu Asp Ala 
60 

Gin He Lys His 
75 

Pro Trp Ser Phe 
90 

Ser Trp Gin Val 
105 

Ala Arg Glu Leu 

Pro Leu Thr Asp 
140 

Gly Glu Ala Asp 
155 

Ser Gly Gin Gin 
170 

His Asp Asp Leu 
185 

Leu Arg Tyr Pro 

Ala Gin Gly Trp 
220 

He Gly Tyr Val 
235 

Pro Asp Gin Leu 
250 

Trp Leu Thr Ala 
265 

Asp Trp Ser Tyr 

Val Gly Asp Ala 
300 

Thr Leu Ala Met 
315 

Ala Leu Ala Ala 
330 

Asp Arg Tyr Arg 
345 

Phe Tyr Asp Gly 

Ala He Val Asp 
380 



Val Gly Gly Gly 
15 

Glu Gly His Arg 
30 

His He Gly Glu 
45 

Leu Gly Val Arg 

Gly Gly Ser Leu 
80 

Arg Phe Arg Glu 
95 

Arg Arg Ala Glu 
110 

Gly Val Val Val 
125 

Gly Glu Arg Val 

Pro He Asp Ala 
160 

Arg Trp Leu Gly 
175 

Arg Asn Met Ala 
190 

Gly Asp His Glu 
205 

Leu Trp Tyr Ala 

Thr Pro Ser Asp 
240 

Leu Glu Lys Gin 
255 

Gly Ala Arg Arg 
270 

Ala Cys Ser Gin 
285 

Ala Ala Phe He 

Arg Gly Ala Leu 
320 

Pro Glu Lys Glu 
335 

Asp Phe Leu Ala 
350 

Ala His Gly Arg 
365 

Pro Asp Arg Leu 
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Met Pro Pro Lys 
385 

Gly Asp Glu Thr 

Ser Asp Ala lie 
420 



lie Ser Phe Val 
390 

Leu Asp Arg Ser 
405 



Ser Leu Leu Ser 
395 

Pro Arg Thr Ala 
410 



Gly Leu Ala Arg 
400 

lie Asp Arg Pro 
415 



cgc gcc ggg gtg ccg gca gtg cag cgc aac cag age gec tgg egg acc 
Arg Ala Gly Val Pro Ala Val Gin Arg Asn Gin Ser Ala Trp Arg Thr 
130 135 140 

egg ggc atg cac cgc teg ate gcg tec ttc ctg acc gac ctg atg gac 
Arg Gly Met His Arg Ser He Ala Ser Phe Leu Thr Asp Leu Met Asp 
145 150 155 160 



48 



96 



<210> 39 
<211> 1137 
<212> DNA 
<213> B 

<220> 
<221> CDS 
<222> (1) . . . (1137) 

<400> 39 

atg cgc gtg ctg ttc gtc tec tec ccc ggt ate ggc cac etc ttc ccg 
Met Arg Val Leu Phe Val Ser Ser Pro Gly He Gly His Leu Phe Pro 
15 10 15 

ctg ate cag etc gcc tgg ggc ttc cgc acg gcc ggc cac gac gtg ctg 
Leu He Gin Leu Ala Trp Gly Phe Arg Thr Ala Gly His Asp Val Leu 
20 25 30 

ate gcg gtc gcc gag cac gcc gac egg gcc gcc gcc gcg ggc ctg gag 
He Ala Val Ala Glu His Ala Asp Arg Ala Ala Ala Ala Gly Leu Glu 
35 40 45 

gtc gtc gac gtg gcg ccc gac tac age gcg gtc aag gtc ttc gag cag 
Val Val Asp Val Ala Pro Asp Tyr Ser Ala Val Lys Val Phe Glu Gin 
50 55 60 

gtg gcc aag gac aac ccg cgc ttc gcc gag acc gtc gcc acg cgt ccc 
Val Ala Lys Asp Asn Pro Arg Phe Ala Glu Thr Val Ala Thr Arg Pro 
65 70 75 80 

gcg ate gat ctg gag gag tgg ggc gtg cag ate gcg gcg gtg aac cgc 
Ala He Asp Leu Glu Glu Trp Gly Val Gin He Ala Ala Val Asn Arg 
85 90 95 

ccg ctg gtc gac ggg acc atg gcg ctg gtc gac gac tac cgt ccc gac 336 
Pro Leu Val Asp Gly Thr Met Ala Leu Val Asp Asp Tyr Arg Pro Asp 
100 105 HO 

ctg gtg gtc tac gag cag ggc gcc acc gtc ggc ctg ctg gcc gcc gac 384 
Leu Val Val Tyr Glu Gin Gly Ala Thr Val Gly Leu Leu Ala Ala Asp 
115 120 125 



144 



192 



240 



288 



432 



480 
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aag cac cag gtc age ctg ccc gag ccg gtg gcg acg ate gag teg ttc 
Lys His Gin Val Ser Leu Pro Glu Pro Val Ala Thx lie Glu Ser Phe 
165 170 175 



528 



ccg ccg age ctg ctg ctg gag gcg gag ccc gag ggc tgg ttc atg cgc 576 

Pro Pro Ser Leu Leu Leu Glu Ala Glu Pro Glu Gly Trp Phe Met Arg 

180 185 190 

tgg gtg ccg tac ggc ggc ggc gee gtc etc ggc gac egg ctg ccg ccg 624 

Trp Val Pro Tyr Gly Gly Gly Ala Val Leu Gly Asp Arg Leu Pro Pro 

195 200 205 

gtg ccc gec egg ccc gag gtg gcg ate acc atg ggc ace ate gag etc 6 72 

Val Pro Ala Arg Pro Glu Val Ala lie Thr Met Gly Thr He Glu Leu 
210 215 220 

cag gcg ttc ggc ate ggc gee gtg gag ccc ate ate gec gec gec ggc 72 0 

Gin Ala Phe Gly He Gly Ala Val Glu Pro He He Ala Ala Ala Gly 
225 230 235 240 

gag gtg gac gec gac ttc gtg etc gee etc ggc gac etc gac ate age 768 

Glu Val Asp Ala Asp Phe Val Leu Ala Leu Gly Asp Leu Asp He Ser 
245 250 255 

ccg ctg ggc acg ttg ccg cgc aac gtc egg gcg gtc ggc tgg acg ccg 816 

Pro Leu Gly Thr Leu Pro Arg Asn Val Arg Ala Val Gly Trp Thr Pro 

260 265 270 

ctg cac acc ctg ctg cgt acc tgc acc gcg gtg gtg cac cac ggc ggg 864 

Leu His Thr Leu Leu Arg Thr Cys Thr Ala Val Val His His Gly Gly 

275 280 285 

S9 C 99 c ac 9 gtg at g acc g cc a tc gac gec ggc ate ccg cag ctg etc 912 

Gly Gly Thr Val Met Thr Ala He Asp Ala Gly He Pro Gin Leu Leu 
290 295 300 

gcc ccg gac ccg cgc gac cag ttc cag cac acc gcc egg gag gcc gtc 96 0 

Ala Pro Asp Pro Arg Asp Gin Phe Gin His Thr Ala Arg Glu Ala Val 
305 310 315 320 

age egg cgc ggc ate ggc ctg gtc age acg teg gac aag gtc gac gcg 1008 

Ser Arg Arg Gly He Gly Leu Val Ser Thr Ser Asp Lys Val Asp Ala 
325 330 335 

gac ctg ctg cgc egg ctg ate ggg gac gag teg ctg cgc acc gcg gcc 1056 

Asp Leu Leu Arg Arg Leu He Gly Asp Glu Ser Leu Arg Thr Ala Ala 

340 345 350 

c^ST 9"ag 9"ta cgc gag gag atg gtc gcg ctg ccc acg ccg gcg gag acg 1104 

Arg Glu Val Arg Glu Glu Met Val Ala Leu Pro Thr Pro Ala Glu Thr 

355 360 365 

gtg egg cgc ate gtc gag cgc ate teg ggt tga 113 7 

Val Arg Arg He Val Glu Arg He Ser Gly * 
370 375 



<210> 40 
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<211> 378 
<212> PRT 
<213> Bacteria 



<400> 40 
Met Arg Val Leu 
1 

Leu He Gin Leu 
20 

He Ala Val Ala 
35 

Val Val Asp Val 
50 

Val Ala Lys Asp 
65 

Ala He Asp Leu 

Pro Leu Val Asp 
100 

Leu Val Val Tyr 
115 

Arg Ala Gly Val 
130 

Arg Gly Met His 
145 

Lys His Gin Val 

Pro Pro Ser Leu 
180 

Trp Val Pro Tyr 
195 

Val Pro Ala Arg 
210 

Gin Ala Phe Gly 
225 

Glu Val Asp Ala 

Pro Leu Gly Thr 
260 

Leu His Thr Leu 

275 

Gly Gly Thr Val 
290 

Ala Pro Asp Pro 
305 

Ser Arg Arg Gly 

Asp Leu Leu Arg 
340 

Arg Glu Val Arg 
355 

Val Arg Arg He 
370 



Phe Val Ser Ser 
5 

Ala Trp Gly Phe 

Glu His Ala Asp 
40 

Ala Pro Asp Tyr 
55 

Asn Pro Arg Phe 
70 

Glu Glu Trp Gly 
85 

Gly Thr Met Ala 

Glu Gin Gly Ala 
120 

Pro Ala Val Gin 
135 

Arg Ser He Ala 
150 

Ser Leu Pro Glu 
165 

Leu Leu Glu Ala 

Gly Gly Gly Ala 
200 

Pro Glu Val Ala 
215 

He Gly Ala Val 
230 

Asp Phe Val Leu 
245 

Leu Pro Arg Asn 

Leu Arg Thr Cys 
280 

Met Thr Ala He 
295 

Arg Asp Gin Phe 
310 

lie Gly Leu Val 
325 

Arg Leu He Gly 

Glu Glu Met Val 
360 

Val Glu Arg He 
375 



Pro Gly He Gly 
10 

Arg Thr Ala Gly 
25 

Arg Ala Ala Ala 

Ser Ala Val Lys 
60 

Ala Glu Thr Val 
75 

Val Gin He Ala 
90 

Leu Val Asp Asp 
105 

Thr Val Gly Leu 

Arg Asn Gin Ser 
140 

Ser Phe Leu Thr 
155 

Pro Val Ala Thr 
170 

Glu Pro Glu Gly 
185 

Val Leu Gly Asp 

He Thr Met Gly 
220 

Glu Pro He He 
235 

Ala Leu Gly Asp 
250 

Val Arg Ala Val 
265 

Thr Ala Val Val 

Asp Ala Gly He 
300 

Gin His Thr Ala 
315 

Ser Thr Ser Asp 
330 

Asp Glu Ser Leu 
345 

Ala Leu Pro Thr 
Ser Gly 



His Leu Phe Pro 
15 

His Asp Val Leu 
30 

Ala Gly Leu Glu 
45 

Val Phe Glu Gin 

Ala Thr Arg Pro 
80 

Ala Val Asn Arg 
95 

Tyr Arg Pro Asp 
110 

Leu Ala Ala Asp 
125 

Ala Trp Arg Thr 

Asp Leu Met Asp 
160 

He Glu Ser Phe 
175 

Trp Phe Met Arg 
190 

Arg Leu Pro Pro 
205 

Thr He Glu Leu 

Ala Ala Ala Gly 
240 

Leu Asp He Ser 
255 

Gly Trp Thr Pro 
270 

His His Gly Gly 
285 

Pro Gin Leu Leu 

Arg Glu Ala Val 
320 

Lys Val Asp Ala 
335 

Arg Thr Ala Ala 
350 

Pro Ala Glu Thr 
365 



<210> 41 
<211> 1035 
<212> DNA 
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<213> Bacteria 



<220> 
<221> CDS 

<222> (1) . . . (1035) 
<400> 41 

atg cgc acc gcc gga acg tac ate cgt ggg ate ggg gec tac ctt cct 48 
Met Arg Thr Ala Gly Thr Tyr lie Arg Gly lie Gly Ala Tyr Leu Pro 
15 10 15 

gag acc gtc acc gtc gag gaa gcc gtc gcc cag ggc ctg tac ccg cag 96 
Glu Thr Val Thr Val Glu Glu Ala Val Ala Gin Gly Leu Tyr Pro Gin 
20 25 30 

gag gac ate gag acg cac ggg ctg ggc ggg gcc gcg ate gcc ggc gaa 144 
Glu Asp He Glu Thr His Gly Leu Gly Gly Ala Ala He Ala Gly Glu 
35 40 45 

ctg ccc gcg ccg gac atg gcg ctg egg gcc gcg cag gac gcc etc aag 192 
Leu Pro Ala Pro Asp Met Ala Leu Arg Ala Ala Gin Asp Ala Leu Lys 
50 55 60 

gag age gag ctg ggc cgc ggc gac ate gac ctg ctg etc tac gcg gcc 24 0 
Glu Ser Glu Leu Gly Arg Gly Asp He Asp Leu Leu Leu Tyr Ala Ala 
65 70 75 80 

gcc tgg cac cag ggc ccc gag ggc tgg ctg gcg cac tec tac ate cag 288 
Ala Trp His Gin Gly Pro Glu Gly Trp Leu Ala His Ser Tyr He Gin 
85 90 95 

cac tac ctg etc ggc ggg gtg ccc egg gcg acc gag ate egg cag ggc 33 6 
His Tyr Leu Leu Gly Gly Val Pro Arg Ala Thr Glu He Arg Gin Gly 
100 105 110 

tgc aac ggc atg ttc acc atg etc gaa etc gcg gcc age tac ctg aag 3 84 
Cys Asn Gly Met Phe Thr Met Leu Glu Leu Ala Ala Ser Tyr Leu Lys 
115 120 125 

gcc gcg ccg gaa cgc aag gcg gcg atg etc gtc gcc gcc gac aac tac 43 2 
Ala Ala Pro Glu Arg Lys Ala Ala Met Leu Val Ala Ala Asp Asn Tyr 
130 135 140 

ggc acc ccg ctg ctg gac cgc tgg cgc acc aac etc ggc ttc ate etc 480 
Gly Thr Pro Leu Leu Asp Arg Trp Arg Thr Asn Leu Gly Phe He Leu 
145 150 155 160 

ggc gac gcc gcc tec gcg gtg gtg ctg age acc gag age ggc ttc gtc 52 8 
Gly Asp Ala Ala Ser Ala Val Val Leu Ser Thr Glu Ser Gly Phe Val 
165 170 175 

gag ctg atg teg gtc tgc tec ate acc gtg ccg gag gcc gag gag gtg 576 
Glu Leu Met Ser Val Cys Ser He Thr Val Pro Glu Ala Glu Glu Val 
180 185 190 

cac cgc ggc ggc gag ccg atg ttc ccg ccc ggc gcg acg etc gcc aag 624 
His Arg Gly Gly Glu Pro Met Phe Pro Pro Gly Ala Thr Leu Ala Lys 
195 200 205 
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gag etc gac ttc ggc gec egg etc ttc tac cac ate acc gag cag acc 
Glu Leu Asp Phe Gly Ala Arg Leu Phe Tyr His lie Thr Glu Gin Thr 
210 215 220 



672 



ccc gtg etc gee gtg etc ggc gag gcg cag gag acg atg acc acc gtc 72 0 
Pro Val Leu Ala Val Leu Gly Glu Ala Gin Glu Thr Met Thr Thr Val 
225 230 235 240 

gec gag cag gcg etc gee gag gee ggc ate ggc acg gga gac ctg gec 768 
Ala Glu Gin Ala Leu Ala Glu Ala Gly He Gly Thr Gly Asp Leu Ala 
245 250 255 

aag gtc tec ttc atg aac tac tec cgc gag gtg gtc gag cag cgc tgc 816 
Lys Val Ser Phe Met Asn Tyr Ser Arg Glu Val Val Glu Gin Arg Cys 
260 265 270 

atg gcg ccc ctg ggg ctg ggg atg gag aag tec acc tgg gac ttc ggg 864 
Met Ala Pro Leu Gly Leu Gly Met Glu Lys Ser Thr Trp Asp Phe Gly 
275 280 285 

egg atg ate ggg cac tgc ggc gee age gac cac ctg etc gee ctg cac 912 
Arg Met He Gly His Cys Gly Ala Ser Asp His Leu Leu Ala Leu His 
290 295 300 

cac teg ctg egg gee ggt gag gtc gec gec ggc gac cac gtg ctg tgg 96 0 
His Ser Leu Arg Ala Gly Glu Val Ala Ala Gly Asp His Val Leu Trp 
305 310 315 320 

ctg gcg atg ggc ccc ggc gtg gag ttc acc gee gee gtc ctg cgc gta 1008 
Leu Ala Met Gly Pro Gly Val Glu Phe Thr Ala Ala Val Leu Arg Val 
325 330 335 

ctg gac aac ccc tac gtc gag cgc tga 1035 
Leu Asp Asn Pro Tyr Val Glu Arg * 
340 



<210> 42 
<211> 344 
<212> PRT 
<213> Bacteria 

<400> 42 
Met Arg Thr Ala 
1 

Glu Thr Val Thr 
20 

Glu Asp He Glu 
35 

Leu Pro Ala Pro 
50 

Glu Ser Glu Leu 
65 

Ala Trp His Gin 

His Tyr Leu Leu 
100 



Gly Thr Tyr He 
5 

Val Glu Glu Ala 

Thr His Gly Leu 
40 

Asp Met Ala Leu 
55 

Gly Arg Gly Asp 
70 

Gly Pro Glu Gly 
85 

Gly Gly Val Pro 



Arg Gly He Gly 
10 

Val Ala Gin Gly 
25 

Gly Gly Ala Ala 

Arg Ala Ala Gin 
60 

He Asp Leu Leu 
75 

Trp Leu Ala His 
90 

Arg Ala Thr Glu 
105 



Ala Tyr Leu Pro 
15 

Leu Tyr Pro Gin 
30 

He Ala Gly Glu 
45 

Asp Ala Leu Lys 

Leu Tyr Ala Ala 
80 

Ser Tyr He Gin 
95 

He Arg Gin Gly 
110 
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Cys Asn Gly Met Phe Thr Met Leu Glu Leu Ala Ala Ser Tyr Leu Lys 

115 120 125 

Ala Ala Pro Glu Arg Lys Ala Ala Met Leu Val Ala Ala Asp Asn Tyr 

130 135 140 

Gly Thr Pro Leu Leu Asp Arg Trp Arg Thr Asn Leu Gly Phe lie Leu 
145 150 155 160 

Gly Asp Ala Ala Ser Ala Val Val Leu Ser Thr Glu Ser Gly Phe Val 

165 170 175 

Glu Leu Met Ser Val Cys Ser lie Thr Val Pro Glu Ala Glu Glu Val 

180 185 190 

His Arg Gly Gly Glu Pro Met Phe Pro Pro Gly Ala Thr Leu Ala Lys 

195 200 205 

Glu Leu Asp Phe Gly Ala Arg Leu Phe Tyr His lie Thr Glu Gin Thr 

210 215 220 

Pro Val Leu Ala Val Leu Gly Glu Ala Gin Glu Thr Met Thr Thr Val 
225 230 235 240 

Ala Glu Gin Ala Leu Ala Glu Ala Gly lie Gly Thr Gly Asp Leu Ala 

245 250 255 

Lys Val Ser Phe Met Asn Tyr Ser Arg Glu Val Val Glu Gin Arg Cys 

260 265 270 

Met Ala Pro Leu Gly Leu Gly Met Glu Lys Ser Thr Trp Asp Phe Gly 

275 280 285 

Arg Met lie Gly His Cys Gly Ala Ser Asp His Leu Leu Ala Leu His 

290 295 300 

His Ser Leu Arg Ala Gly Glu Val Ala Ala Gly Asp His Val Leu Trp 
305 310 315 320 

Leu Ala Met Gly Pro Gly Val Glu Phe Thr Ala Ala Val Leu Arg Val 

325 330 335 

Leu Asp Asn Pro Tyr Val Glu Arg 
340 



<210> 43 

<211> 1347 

<212> DNA 

<213> . 

<220> 

<221> CDS 

<222> (1) . . . (1347) 

<400> 43 

gtg acc ggg cgc gac gac egg ccc gac ggc get egg ccg gtc cca ccc 4 8 
Val Thr Gly Arg Asp Asp Arg Pro Asp Gly Ala Arg Pro Val Pro Pro 
15 10 15 



ggg cca gcg gtc acg ccc ggg cca gcg gtc acg ccc ggg ccg ccg gtc 96 
Gly Pro Ala Val Thr Pro Gly Pro Ala Val Thr Pro Gly Pro Pro Val 
20 25 30 

acg cca ggg egg gcg gcg gac gga ccg gec gag gec ggg age gcg gec 144 
Thr Pro Gly Arg Ala Ala Asp Gly Pro Ala Glu Ala Gly Ser Ala Ala 
35 40 45 

ggg ate gac gcg ttt ccg ctg ccc cgc egg tgc ccg ttc ggc ccg ccg 192 
Gly lie Asp Ala Phe Pro Leu Pro Arg Arg Cys Pro Phe Gly Pro Pro 
50 55 60 
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gcc gag tac gcc egg ctg egg acc gag egg ccg gtc gec egg ctg ccc 240 
Ala Glu Tyr Ala Arg Leu Arg Thr Glu Arg Pro Val Ala Arg Leu Pro 
65 70 75 80 

atg etc ggc ggc aac acg gcc tgg gtg gtg tec egg tac gcc gac gtc 288 
Met Leu Gly Gly Asn Thr Ala Trp Val Val Ser Arg Tyr Ala Asp Val 
85 90 95 

aag egg gtg etc age gac ccg egg atg age gcg gac egg cgc egg gcc 336 
Lys Arg Val Leu Ser Asp Pro Arg Met Ser Ala Asp Arg Arg Arg Ala 
100 105 110 

ggt ttt ccg egg ttc gcg ccg acc acc gag age cag egg cag gcc teg 384 
Gly Phe Pro Arg Phe Ala Pro Thr Thr Glu Ser Gin Arg Gin Ala Ser 
115 120 125 

ttc gcg aac ttc cgc ccc ccg ctg aac tgg atg gac ccg ccg gag cac 432 
Phe Ala Asn Phe Arg Pro Pro Leu Asn Trp Met Asp Pro Pro Glu His 
130 135 140 

acc gcc gcc cgc cgc cag ate gtc gac gag ttc gcc gcg egg egg gta 480 
Thr Ala Ala Arg Arg Gin lie Val Asp Glu Phe Ala Ala Arg Arg Val 
145 150 155 160 

cga cag ctg egg ccg ctg gtc gag egg gtg gtg gac gag cac etc gac 528 
Arg Gin Leu Arg Pro Leu Val Glu Arg Val Val Asp Glu His Leu Asp 
165 170 175 

gcc atg acg gcc ggg egg teg age gcc gac ctg gtg ccg teg ttc age 576 
Ala Met Thr Ala Gly Arg Ser Ser Ala Asp Leu Val Pro Ser Phe Ser 
180 185 190 

tat ccg gtg ccg teg egg gtg ate tgc gag atg etc ggc gtg ccg tac 624 
Tyr Pro Val Pro Ser Arg Val lie Cys Glu Met Leu Gly Val Pro Tyr 
195 200 205 

ggc gaa cac gcg ttc ttc gag cgc egg tec acc egg atg ctg agt cgc 672 
Gly Glu His Ala Phe Phe Glu Arg Arg Ser Thr Arg Met Leu Ser Arg 
210 215 220 

ggc gtg ccc gcg gac gag egg gcc egg tgc gcc cgc gag ate cgc gag 72 0 
Gly Val Pro Ala Asp Glu Arg Ala Arg Cys Ala Arg Glu lie Arg Glu 
225 230 235 240 

ttc etc gac ggc gtg gtg acc gac aag gag egg cac ccc ggc gac gac 768 
Phe Leu Asp Gly Val Val Thr Asp Lys Glu Arg His Pro Gly Asp Asp 
245 250 255 

gtg etc age egg ctg etc gcc gcg cag cgc gcg gcc ggc gag ccc gac 816 
Val Leu Ser Arg Leu Leu Ala Ala Gin Arg Ala Ala Gly Glu Pro Asp 
260 265 270 

cac gag gcc gtg gtg age atg gcc ttc gtg ctg ctg gtc gcc ggg cac 864 
His Glu Ala Val Val Ser Met Ala Phe Val Leu Leu Val Ala Gly His 
275 280 285 

gtc acg acg teg aac atg ate teg ctg age gtg ctg gcc ctg ttg acc 912 
Val Thr Thr Ser Asn Met lie Ser Leu Ser Val Leu Ala Leu Leu Thr 
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290 295 300 

cat ccg gag egg etc gec cgc ctg cgc gec gag ccg gac egg ttc ccc 960 
His Pro Glu Arg Leu Ala Arg Leu Arg Ala Glu Pro Asp Arg Phe Pro 
305 310 315 320 

gec gec gtc gag gag ctg ctg egg tac ttc acc ate gtc gag gcg gcg 10 08 
Ala Ala Val Glu Glu Leu Leu Arg Tyr Phe Thr lie Val Glu Ala Ala 
325 330 335 

acc gcg egg acc gcg acc gec gac gtg acg gtc ggt ggg gtc acc ate 1056 
Thr Ala Arg Thr Ala Thr Ala Asp Val Thr Val Gly Gly Val Thr lie 
340 345 350 

egg gee ggg gag ggg gtg gtg gcg ctg ggc cag gec gee aac egg gac 1104 
Arg Ala Gly Glu Gly Val Val Ala Leu Gly Gin Ala Ala Asn Arg Asp 
355 360 365 

ccg gcg gcg ttc gac egg ccg gac gag ttc gac ccg gac cgc gac gec 1152 
Pro Ala Ala Phe Asp Arg Pro Asp Glu Phe Asp Pro Asp Arg Asp Ala 
370 375 380 

egg cac cac etc gee ttc ggc tac gga cga cac ate tgc ccc ggc cag 1200 
Arg His His Leu Ala Phe Gly Tyr Gly Arg His lie Cys Pro Gly Gin 
385 390 395 400 

cac ctg gee egg ctg gaa ctg gac gtc gcg ctg age egg ctg gtc egg 1248 
His Leu Ala Arg Leu Glu Leu Asp Val Ala Leu Ser Arg Leu Val Arg 
405 410 415 

egg ctg ccc ggg ctg egg ttg acc gtg gac gtg gac gac ctg ccg etc 12 96 
Arg Leu Pro Gly Leu Arg Leu Thr Val Asp Val Asp Asp Leu Pro Leu 
420 425 430 

aag gag gac ggc aac ate ttc ggc ctg cac gcg ctg ccg gtc gee tgg 1344 
Lys Glu Asp Gly Asn lie Phe Gly Leu His Ala Leu Pro Val Ala Trp 
435 440 445 

tga 1347 



<210> 44 
<211> 448 
<212> PRT 
<213> Bacteria 

<400> 44 

Val Thr Gly Arg Asp Asp Arg Pro Asp Gly Ala Arg Pro Val Pro Pro 

15 10 15 

Gly Pro Ala Val Thr Pro Gly Pro Ala Val Thr Pro Gly Pro Pro Val 

20 25 30 

Thr Pro Gly Arg Ala Ala Asp Gly Pro Ala Glu Ala Gly Ser Ala Ala 

35 40 45 

Gly lie Asp Ala Phe Pro Leu Pro Arg Arg Cys Pro Phe Gly Pro Pro 

50 55 60 

Ala Glu Tyr Ala Arg Leu Arg Thr Glu Arg Pro Val Ala Arg Leu Pro 
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65 

Met Leu Gly Gly 

Lys Arg Val Leu 
100 

Gly Phe Pro Arg 
115 

Phe Ala Asn Phe 
130 

Thr Ala Ala Arg 
145 

Arg Gin Leu Arg 

Ala Met Thr Ala 
180 

Tyr Pro Val Pro 
195 

Gly Glu His Ala 
210 

Gly Val Pro Ala 
225 

Phe Leu Asp Gly 

Val Leu Ser Arg 
260 

His Glu Ala Val 
275 

Val Thr Thr Ser 
290 

His Pro Glu Arg 
305 

Ala Ala Val Glu 

Thr Ala Arg Thr 
340 

Arg Ala Gly Glu 
355 

Pro Ala Ala Phe 
370 

Arg His His Leu 
385 

His Leu Ala Arg 

Arg Leu Pro Gly 
420 

Lys Glu Asp Gly 
435 



70 

Asn Thr Ala Trp 
85 

Ser Asp Pro Arg 

Phe Ala Pro Thr 
120 

Arg Pro Pro Leu 
135 

Arg Gin lie Val 
150 

Pro Leu Val Glu 
165 

Gly Arg Ser Ser 

Ser Arg Val lie 
200 

Phe Phe Glu Arg 
215 

Asp Glu Arg Ala 
230 

Val Val Thr Asp 
245 

Leu Leu Ala Ala 
Val Ser Met Ala 

280 

Asn Met lie Ser 
295 

Leu Ala Arg Leu 
310 

Glu Leu Leu Arg 
325 

Ala Thr Ala Asp 

Gly Val Val Ala 
360 

Asp Arg Pro Asp 
375 

Ala Phe Gly Tyr 
390 

Leu Glu Leu Asp 
405 

Leu Arg Leu Thr 

Asn lie Phe Gly 
440 



75 

Val Val Ser Arg 
90 

Met Ser Ala Asp 
105 

Thr Glu Ser Gin 

Asn Trp Met Asp 
140 

Asp Glu Phe Ala 
155 

Arg Val Val Asp 
170 

Ala Asp Leu Val 
185 

Cys Glu Met Leu 

Arg Ser Thr Arg 
220 

Arg Cys Ala Arg 
235 

Lys Glu Arg His 
250 

Gin Arg Ala Ala 
265 

Phe Val Leu Leu 

Leu Ser Val Leu 
300 

Arg Ala Glu Pro 
315 

Tyr Phe Thr lie 
330 

Val Thr Val Gly 
345 

Leu Gly Gin Ala 

Glu Phe Asp Pro 
380 

Gly Arg His lie 
395 

Val Ala Leu Ser 
410 

Val Asp Val Asp 
425 

Leu His Ala Leu 



80 

Tyr Ala Asp Val 
95 

Arg Arg Arg Ala 
110 

Arg Gin Ala Ser 
125 

Pro Pro Glu His 

Ala Arg Arg Val 
160 

Glu His Leu Asp 
175 

Pro Ser Phe Ser 
190 

Gly Val Pro Tyr 
205 

Met Leu Ser Arg 

Glu lie Arg Glu 
240 

Pro Gly Asp Asp 
255 

Gly Glu Pro Asp 
270 

Val Ala Gly His 
285 

Ala Leu Leu Thr 

Asp Arg Phe Pro 
320 

Val Glu Ala Ala 
335 

Gly Val Thr lie 
350 

Ala Asn Arg Asp 
365 

Asp Arg Asp Ala 

Cys Pro Gly Gin 
400 

Arg Leu Val Arg 
415 

Asp Leu Pro Leu 
430 

Pro Val Ala Trp 
445 



<210> 45 
<211> 588 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (588) 
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<400> 45 

egg ccc cac cca tgg cga ccc ggc agg tec cgc tgc gcg agg tgc gcg 4 8 

Arg Pro His Pro Trp Arg Pro Gly Arg Ser Arg Cys Ala Arg Cys Ala 
15 10 15 

act ggt acc gcg ccg acg acg aga tec gec acc get ccg gec ggt tct 96 
Thr Gly Thr Ala Pro Thr Thr Arg Ser Ala Thr Ala Pro Ala Gly Ser 
20 25 30 

tec gca teg teg gee ggc ggg tgc ggg cca gca acc gcg agg tgt ccc 144 
Ser Ala Ser Ser Ala Gly Gly Cys Gly Pro Ala Thr Ala Arg Cys Pro 
35 40 45 

a 9t ggt gec age cgt tgc tgg cac cgt gcg gga egg gtc tgg tgg cgt 192 
Ser Gly Ala Ser Arg Cys Trp His Arg Ala Gly Arg Val Trp Trp Arg 
50 55 60 

teg teg tec gac gca teg acg gcg tgc tgc acg tgc teg ccc acg ccg 24 0 
Ser Ser Ser Asp Ala Ser Thr Ala Cys Cys Thr Cys Ser Pro Thr Pro 
65 70 75 80 

acc tgc ggc ccg get acc ggg aca ccg teg age tgg gac cga ccg tgc 288 
Thr Cys Gly Pro Ala Thr Gly Thr Pro Ser Ser Trp Asp Arg Pro Cys 
85 90 95 

agt gca ccc egg gac aac ttc acc ggc ccg gec egg gac ggc cgc ccg 336 
Ser Ala Pro Arg Asp Asn Phe Thr Gly Pro Ala Arg Asp Gly Arg Pro 
100 105 110 

gcg tac etc gac ctg gtg etc tec gac gag gtc cgc gtg cac tac gac 3 84 
Ala Tyr Leu Asp Leu Val Leu Ser Asp Glu Val Arg Val His Tyr Asp 
115 120 125 

gtg ctg cag teg gag gag ggc ggg egg ttc cac cac gcg gtg acc egg 432 
Val Leu Gin Ser Glu Glu Gly Gly Arg Phe His His Ala Val Thr Arg 
130 135 140 

cac atg gtg gtg gag gtg ggc ccg gac ttc ccc acc gcg aca ccg ccg 48 0 
His Met Val Val Glu Val Gly Pro Asp Phe Pro Thr Ala Thr Pro Pro 
145 150 155 160 

gac tac acc tgg ctg acc ctg cgc cag ttg acc gec gtg gcg gee ttc 528 
Asp Tyr Thr Trp Leu Thr Leu Arg Gin Leu Thr Ala Val Ala Ala Phe 
165 170 175 

age tat cag gtc aac ate gag gcg cgc age etc ctg etc tgc ctg egg 576 
Ser Tyr Gin Val Asn lie Glu Ala Arg Ser Leu Leu Leu Cys Leu Arg 
180 185 190 

gcg ctg egg tga 588 
Ala Leu Arg * 
195 



<210> 46 
<211> 195 
<212> PRT 
<213> Bacteria 
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<400> 46 

Arg Pro His Pro Trp Arg Pro Gly Arg Ser Arg Cys Ala Arg Cys Ala 

15 10 15 

Thr Gly Thr Ala Pro Thr Thr Arg Ser Ala Thr Ala Pro Ala Gly Ser 

20 25 30 

Ser Ala Ser Ser Ala Gly Gly Cys Gly Pro Ala Thr Ala Arg Cys Pro 

35 40 45 

Ser Gly Ala Ser Arg Cys Trp His Arg Ala Gly Arg Val Trp Trp Arg 

50 55 60 

Ser Ser Ser Asp Ala Ser Thr Ala Cys Cys Thr Cys Ser Pro Thr Pro 
65 70 75 80 

Thr Cys Gly Pro Ala Thr Gly Thr Pro Ser Ser Trp Asp Arg Pro Cys 

85 90 95 

Ser Ala Pro Arg Asp Asn Phe Thr Gly Pro Ala Arg Asp Gly Arg Pro 

100 105 110 

Ala Tyr Leu Asp Leu Val Leu Ser Asp Glu Val Arg Val His Tyr Asp 

115 120 125 

Val Leu Gin Ser Glu Glu Gly Gly Arg Phe His His Ala Val Thr Arg 

130 135 140 

His Met Val Val Glu Val Gly Pro Asp Phe Pro Thr Ala Thr Pro Pro 
145 150 155 160 

Asp Tyr Thr Trp Leu Thr Leu Arg Gin Leu Thr Ala Val Ala Ala Phe 

165 170 175 

Ser Tyr Gin Val Asn lie Glu Ala Arg Ser Leu Leu Leu Cys Leu Arg 
180 185 190 

Ala Leu Arg 
195 



<210> 47 
<211> 591 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> {!)... (591) 

<400> 47 

atg acc egg gac gat ccc gec gac aac ccg tac cag gtg gec gtc ate 48 
Met Thr Arg Asp Asp Pro Ala Asp Asn Pro Tyr Gin Val Ala Val lie 
15 10 15 

ggc ate ggt tgc egg ctg ccc age gac gtc gac acc ccg gac gec etc 96 
Gly lie Gly Cys Arg Leu Pro Ser Asp Val Asp Thr Pro Asp Ala Leu 
20 25 30 

tgg gag ctg eta etc aag ggc ggc cag acc gec ggc gag ate ccg gcg 144 
Trp Glu Leu Leu Leu Lys Gly Gly Gin Thr Ala Gly Glu lie Pro Ala 
35 40 45 

cag cgc tgg cgc gec tac egg gag cgc ggc ccc gag tac gag gcg gtc 192 
Gin Arg Trp Arg Ala Tyr Arg Glu Arg Gly Pro Glu Tyr Glu Ala Val 
50 55 60 

ctg cgc gac acc gtc acc gee ggc age tac ctg cgt gac gtc gcg ggc 240 
Leu Arg Asp Thr Val Thr Ala Gly Ser Tyr Leu Arg Asp Val Ala Gly 
65 70 75 80 
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ttc gac ccc gag ttc ttc ggc ctg teg ccc egg gag gcg gec gag atg 288 
Phe Asp Pro Glu Phe Phe Gly Leu Ser Pro Arg Glu Ala Ala Glu Met 
85 90 95 



gac ccg cag cag egg ate ctg etc gag gtc ggc tgg gag gee ctg gag 336 
Asp Pro Gin Gin Arg lie Leu Leu Glu Val Gly Trp Glu Ala Leu Glu 
100 105 110 

cac gec ggc ctg cca ccc acc egg ctg gec ggc acc gac acg ggc gtc 384 
His Ala Gly Leu Pro Pro Thr Arg Leu Ala Gly Thr Asp Thr Gly Val 
115 120 125 

ttc gtc ggg gac age acc acc gac tac ggc gac egg ctg ctg gag gac 432 
Phe Val Gly Asp Ser Thr Thr Asp Tyr Gly Asp Arg Leu Leu Glu Asp 
130 135 140 

ctg ccg acc gtc gag gcg tac acc ggg ate ggc gcg gec acc tgc gee 4 80 
Leu Pro Thr Val Glu Ala Tyr Thr Gly He Gly Ala Ala Thr Cys Ala 
145 150 155 160 

ctg gec aac cgc ate tec tac gcg ctg gac ctg cac ggc ccg age gtc 528 
Leu Ala Asn Arg He Ser Tyr Ala Leu Asp Leu His Gly Pro Ser Val 
165 170 175 

gec gag gac acg gtc tgc teg gcg teg ctg gtc gcg gtg cac ctg gec 576 
Ala Glu Asp Thr Val Cys Ser Ala Ser Leu Val Ala Val His Leu Ala 
180 185 190 

tgc cag age ctg ctg 591 
Cys Gin Ser Leu Leu 
195 



<210> 48 
<211> 197 
<212> PRT 
<213> Bacteria 

<400> 48 

Met Thr Arg Asp Asp Pro Ala Asp Asn Pro Tyr Gin Val Ala Val He 

15 10 15 

Gly He Gly Cys Arg Leu Pro Ser Asp Val Asp Thr Pro Asp Ala Leu 

20 25 30 

Trp Glu Leu Leu Leu Lys Gly Gly Gin Thr Ala Gly Glu He Pro Ala 

35 40 45 

Gin Arg Trp Arg Ala Tyr Arg Glu Arg Gly Pro Glu Tyr Glu Ala Val 

50 55 60 

Leu Arg Asp Thr Val Thr Ala Gly Ser Tyr Leu Arg Asp Val Ala Gly 
65 70 75 80 

Phe Asp Pro Glu Phe Phe Gly Leu Ser Pro Arg Glu Ala Ala Glu Met 

85 90 95 

Asp Pro Gin Gin Arg He Leu Leu Glu Val Gly Trp Glu Ala Leu Glu 

100 105 110 

His Ala Gly Leu Pro Pro Thr Arg Leu Ala Gly Thr Asp Thr Gly Val 

115 120 125 

Phe Val Gly Asp Ser Thr Thr Asp Tyr Gly Asp Arg Leu Leu Glu Asp 
130 135 140 
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Leu Pro Thr Val Glu Ala Tyr Thr Gly lie Gly Ala Ala Thr Cys Ala 
145 150 155 160 

Leu Ala Asn Arg lie Ser Tyr Ala Leu Asp Leu His Gly Pro Ser Val 

165 170 175 

Ala Glu Asp Thr Val Cys Ser Ala Ser Leu Val Ala Val His Leu Ala 

180 185 190 

Cys Gin Ser Leu Leu 
195 



<210> 49 
<211> 618 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . , . (618) 

<400> 49 

ate ccc gag gag gec ggg cag etc age ate gcg ggg gtg gec gag ttg 4 8 
lie Pro Glu Glu Ala Gly Gin Leu Ser lie Ala Gly Val Ala Glu Leu 
15 10 15 

gtg gec cgc cgc gec gac ccg ccc gga cac acc gag aac age gtg etc 96 
Val Ala Arg Arg Ala Asp Pro Pro Gly His Thr Glu Asn Ser Val Leu 
20 25 30 

ate gec gcg ccg ctg ccg ctg gtc tgg gac gtc acc aac gac gtg gec 144 
lie Ala Ala Pro Leu Pro Leu Val Trp Asp Val Thr Asn Asp Val Ala 
35 40 45 

ggc tgg ccc gag ctg ttc acc gag tac gec egg gcg gag ate ctg gac 192 
Gly Trp Pro Glu Leu Phe Thr Glu Tyr Ala Arg Ala Glu lie Leu Asp 
50 55 60 

ggc gac ggc gac acc gtg egg ttc egg etc acc atg cac ccc gac gag 24 0 
Gly Asp Gly Asp Thr Val Arg Phe Arg Leu Thr Met His Pro Asp Glu 
65 70 75 80 

aac ggg gtg gcg tgg age tgg gtc age gaa cgc acg gec gac ccg gec 288 
Asn Gly Val Ala Trp Ser Trp Val Ser Glu Arg Thr Ala Asp Pro Ala 
85 90 95 

acc egg cag gtg cgc gee egg egg gtg gag acc ggg ccg ttc gag tac 336 
Thr Arg Gin Val Arg Ala Arg Arg Val Glu Thr Gly Pro Phe Glu Tyr 
100 105 110 

atg cgc ate cac tgg cgc tac gcg gag gag ccc ggc ggc acg egg atg 3 84 
Met Arg lie His Trp Arg Tyr Ala Glu Glu Pro Gly Gly Thr Arg Met 
115 120 125 

acg tgg gtg cag gac ttc gcg atg aag ccg acc gcg ccg gtg gac aac 432 
Thr Trp Val Gin Asp Phe Ala Met Lys Pro Thr Ala Pro Val Asp Asn 
130 135 140 

gec ggc atg acc gac egg ate aac gec aac age gec gtg cag etc gec 480 
Ala Gly Met Thr Asp Arg lie Asn Ala Asn Ser Ala Val Gin Leu Ala 
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145 



150 



155 



160 



gtc ate egg gac aag ate gaa cgc ctg gec cgc gag gga acg get ggc 52 8 
Val lie Arg Asp Lys lie Glu Arg Leu Ala Arg Glu Gly Thr Ala Gly 
165 170 175 

ccg gee ccc gee gec gcg gec gee acc acg ccc ggc ccg gec ccg gec 576 
Pro Ala Pro Ala Ala Ala Ala Ala Thr Thr Pro Gly Pro Ala Pro Ala 
180 185 190 

gcg cgc acc gec gac gag gcg acg gga gec ggc gat gag tga 618 
Ala Arg Thr Ala Asp Glu Ala Thr Gly Ala Gly Asp Glu * 
195 200 205 



<210> 50 

<211> 205 

<212> PRT 

<213> Bacteria 

<400> 50 
lie Pro Glu Glu 
1 

Val Ala Arg Arg 
20 

lie Ala Ala Pro 
35 

Gly Trp Pro Glu 
50 

Gly Asp Gly Asp 
65 

Asn Gly Val Ala 

Thr Arg Gin Val 
100 

Met Arg lie His 
115 

Thr Trp Val Gin 
130 

Ala Gly Met Thr 
145 

Val lie Arg Asp 

Pro Ala Pro Ala 
180 

Ala Arg Thr Ala 
195 



Ala Gly Gin Leu 
5 

Ala Asp Pro Pro 

Leu Pro Leu Val 
40 

Leu Phe Thr Glu 
55 

Thr Val Arg Phe 
70 

Trp Ser Trp Val 
85 

Arg Ala Arg Arg 

Trp Arg Tyr Ala 
120 

Asp Phe Ala Met 
135 

Asp Arg lie Asn 
150 

Lys lie Glu Arg 
165 

Ala Ala Ala Ala 

Asp Glu Ala Thr 
200 



Ser He Ala Gly 
10 

Gly His Thr Glu 
25 

Trp Asp Val Thr 

Tyr Ala Arg Ala 
60 

Arg Leu Thr Met 
75 

Ser Glu Arg Thr 
90 

Val Glu Thr Gly 
105 

Glu Glu Pro Gly 

Lys Pro Thr Ala 
140 

Ala Asn Ser Ala 
155 

Leu Ala Arg Glu 
170 

Thr Thr Pro Gly 
185 

Gly Ala Gly Asp 



Val Ala Glu Leu 
15 

Asn Ser Val Leu 
30 

Asn Asp Val Ala 
45 

Glu He Leu Asp 

His Pro Asp Glu 
80 

Ala Asp Pro Ala 
95 

Pro Phe Glu Tyr 
110 

Gly Thr Arg Met 
125 

Pro Val Asp Asn 

Val Gin Leu Ala 
160 

Gly Thr Ala Gly 
175 

Pro Ala Pro Ala 
190 

Glu 
205 



<210> 51 
<211> 405 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (405) 
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<400> 51 

atg agt gac cag acc ctg egg ctg gtc gec gec gec gec gtc gcg ccg 4 8 

Met Ser Asp Gin Thr Leu Arg Leu Val Ala Ala Ala Ala Val Ala Pro 
15 10 15 

gac age cgc cgt ggc ggc gag ctg egg gtg ctg etc ggc ccg aag acc 96 
Asp Ser Arg Arg Gly Gly Glu Leu Arg Val Leu Leu Gly Pro Lys Thr 
20 25 30 

gtc ggc age acg tec ggc ttc atg ggg gtg gcg acg ctg cgc ccg ggg 144 
Val Gly Ser Thr Ser Gly Phe Met Gly Val Ala Thr Leu Arg Pro Gly 
35 40 45 

gag egg ate gec gag cac tac cat ccc tac age gag gag ttc ctg tac 192 
Glu Arg lie Ala Glu His Tyr His Pro Tyr Ser Glu Glu Phe Leu Tyr 
50 55 60 

gtc gee egg ggc gcg ate acc gee gac ctg gac gac gag ccg gtg ccg 24 0 
Val Ala Arg Gly Ala lie Thr Ala Asp Leu Asp Asp Glu Pro Val Pro 
65 70 75 80 

ctg gee gee ggg gag gcg ctg ttc gtg ccg cgc tac gtc egg cac egg 288 
Leu Ala Ala Gly Glu Ala Leu Phe Val Pro Arg Tyr Val Arg His Arg 
85 90 95 

ctg cgc aac acc ggc gac gag ccg gec gag gtg gtc ttc cac etc ggt 336 
Leu Arg Asn Thr Gly Asp Glu Pro Ala Glu Val Val Phe His Leu Gly 
100 105 110 

ccc etc gee ccc egg ccc gaa etc ggc cac gtc gac acc gag etc gtc 3 84 
Pro Leu Ala Pro Arg Pro Glu Leu Gly His Val Asp Thr Glu Leu Val 
115 120 125 

gag caa egg ggc ggg teg tga 405 
Glu Gin Arg Gly Gly Ser * 
130 



<210> 52 
<211> 134 
<212> PHT 
<213> Bacteria 

<400> 52 

Met Ser Asp Gin Thr Leu Arg Leu Val Ala Ala Ala Ala Val Ala Pro 

15 10 15 

Asp Ser Arg Arg Gly Gly Glu Leu Arg Val Leu Leu Gly Pro Lys Thr 

20 25 30 

Val Gly Ser Thr Ser Gly Phe Met Gly Val Ala Thr Leu Arg Pro Gly 

35 40 45 

Glu Arg lie Ala Glu His Tyr His Pro Tyr Ser Glu Glu Phe Leu Tyr 

50 55 60 

Val Ala Arg Gly Ala lie Thr Ala Asp Leu Asp Asp Glu Pro Val Pro 
65 70 75 80 

Leu Ala Ala Gly Glu Ala Leu Phe Val Pro Arg Tyr Val Arg His Arg 

85 90 95 

Leu Arg Asn Thr Gly Asp Glu Pro Ala Glu Val Val Phe His Leu Gly 
100 105 110 
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Pro Leu Ala Pro Arg Pro Glu Leu Gly His Val Asp Thr Glu Leu Val 

115 120 125 

Glu Gin Arg Gly Gly Ser 
130 



<210> 53 
<211> 1137 
<212> DNA 
<213> Bacteria 



<220> 

<221> CDS 

<222> (1) . . . (1137) 



<400> 53 

gtg acc ggg cgc cgc acg gtg gtg acc ggc gtc ggg gtg gtc gcc ccc 4 8 
Val Thr Gly Arg Arg Thr Val Val Thr Gly Val Gly Val Val Ala Pro 
15 10 15 



ggc ggc gcc age egg gac egg ttc tgg aag gcc ate acc gag ggg cgc 96 
Gly Gly Ala Ser Arg Asp Arg Phe Trp Lys Ala lie Thr Glu Gly Arg 
20 25 30 



acc gcg acc cgc egg ate acc ttc ttc gac ccg tec gcg ttc egg teg 144 
Thr Ala Thr Arg Arg lie Thr Phe Phe Asp Pro Ser Ala Phe Arg Ser 
35 40 45 



cag ate gcc gcc gag tgc gac ttc gac ccg gtc gcc gcc ggc etc tec 192 
Gin lie Ala Ala Glu Cys Asp Phe Asp Pro Val Ala Ala Gly Leu Ser 
50 55 60 



gag gcc gag egg egg cgc gcc gac egg tac gtg cag ttc gcg etc gcc 24 0 
Glu Ala Glu Arg Arg Arg Ala Asp Arg Tyr Val Gin Phe Ala Leu Ala 
65 70 75 80 



tgc tec gcc gag gcg gtc gcc gac gcc ggg ctg gag etc acc gac gcc 288 
Cys Ser Ala Glu Ala Val Ala Asp Ala Gly Leu Glu Leu Thr Asp Ala 
85 90 95 



gag egg gac cgc gcc ggg gtg gtg etc ggc acc gcc gtc ggc ggc acc 336 
Glu Arg Asp Arg Ala Gly Val Val Leu Gly Thr Ala Val Gly Gly Thr 
100 105 110 



atg gcc ctg gag cag gag tac gtc acg gtc age gac acc ggc cgc egg 384 
Met Ala Leu Glu Gin Glu Tyr Val Thr Val Ser Asp Thr Gly Arg Arg 
115 120 125 



tgg ctg gtc gac gcc gcg cgc ggc ggc ccg tac etc tac cag gcg ctg 432 
Trp Leu Val Asp Ala Ala Arg Gly Gly Pro Tyr Leu Tyr Gin Ala Leu 
130 135 140 



gtg ccg age age ctg gcc gcc gac gtg gcc tgc egg cac ggg ctg cac 480 
Val Pro Ser Ser Leu Ala Ala Asp Val Ala Cys Arg His Gly Leu His 
145 150 155 160 



ggc ccc gcg cag gtg gtc tec acc ggc tgc acc teg ggc ate gac gcc 528 
Gly Pro Ala Gin Val Val Ser Thr Gly Cys Thr Ser Gly lie Asp Ala 
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165 



170 



175 



atc tac 9 CC cac ca 9 ctc atc 9" cc 9 ac 99° 9 a 9 9" cc 9" ac atc 576 

lie Gly Tyr Ala His Gin Leu lie Ala Asp Gly Glu Ala Asp lie Val 
180 185 190 

ctg gcc ggg gcg gcg gac teg cct ate tec ccg gtg ace gtc gcg tec 624 
Leu Ala Gly Ala Ala Asp Ser Pro lie Ser Pro Val Thr Val Ala Ser 
195 200 205 

ttc gac gcg ate aag gcg acc agt ccc gac aac gac gat ccg gcg cac 672 
Phe Asp Ala lie Lys Ala Thr Ser Pro Asp Asn Asp Asp Pro Ala His 
210 215 220 

gcc tec cgc ccg ttc gac gcc gac egg cac ggc ttc gtc ctc gcc gag 72 0 
Ala Ser Arg Pro Phe Asp Ala Asp Arg His Gly Phe Val Leu Ala Glu 
225 230 235 240 

ggc gcg gcg gtg ctg gtg ctg gag gag gcc ggg cac gcc egg egg cgc 7 68 
Gly Ala Ala Val Leu Val Leu Glu Glu Ala Gly His Ala Arg Arg Arg 
245 250 255 

ggc gcc cac gtc tac tgc gag gtc gcc ggc tac gcc age cgc age aac 816 
Gly Ala His Val Tyr Cys Glu Val Ala Gly Tyr Ala Ser Arg Ser Asn 
260 265 270 

ggc tac cac atg acg ggg ctg egg ccc gac ggg ctg gag atg ggg ctg 864 
Gly Tyr His Met Thr Gly Leu Arg Pro Asp Gly Leu Glu Met Gly Leu 
275 280 285 

gcc ate teg gcc gcg ctc aag cag ggc egg ate gcc ccc gag cag gtc 912 
Ala lie Ser Ala Ala Leu Lys Gin Gly Arg lie Ala Pro Glu Gin Val 
290 295 300 

tec tac ate age gcg cac ggt tec ggc acc egg cag aac gac egg cac 96 0 
Ser Tyr lie Ser Ala His Gly Ser Gly Thr Arg Gin Asn Asp Arg His 
305 310 315 320 

gag acc gcc gcg ttc aag egg gcc ctc ggg cag gcc gcg tac egg gtg 1008 
Glu Thr Ala Ala Phe Lys Arg Ala Leu Gly Gin Ala Ala Tyr Arg Val 
325 330 335 

ccg ate age teg ate aag teg atg gtc ggg cac teg ctc ggc gcg ate 1056 
Pro lie Ser Ser lie Lys Ser Met Val Gly His Ser Leu Gly Ala lie 
340 345 350 

ggc teg ate gag atg gcc gcc tgc gcc ctg gcc gtc gag ttc ggc gtg 1104 
Gly Ser lie Glu Met Ala Ala Cys Ala Leu Ala Val Glu Phe Gly Val 
355 360 365 



gtg ccg ccg acg gcc aac tgg acc acc egg gat 113 7 

Val Pro Pro Thr Ala Asn Trp Thr Thr Arg Asp 
370 375 



<210> 54 
<211> 379 
<212> PRT 
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<213> Bacteria 



<400> 54 

Val Thr Gly Arg Arg Thr Val Val Thr Gly Val Gly Val Val Ala Pro 

15 10 15 

Gly Gly Ala Ser Arg Asp Arg Phe Trp Lys Ala lie Thr Glu Gly Arg 

20 25 30 

Thr Ala Thr Arg Arg lie Thr Phe Phe Asp Pro Ser Ala Phe Arg Ser 

35 40 45 

Gin lie Ala Ala Glu Cys Asp Phe Asp Pro Val Ala Ala Gly Leu Ser 

50 55 60 

Glu Ala Glu Arg Arg Arg Ala Asp Arg Tyr Val Gin Phe Ala Leu Ala 
65 70 75 80 

Cys Ser Ala Glu Ala Val Ala Asp Ala Gly Leu Glu Leu Thr Asp Ala 

85 90 95 

Glu Arg Asp Arg Ala Gly Val Val Leu Gly Thr Ala Val Gly Gly Thr 

100 105 110 

Met Ala Leu Glu Gin Glu Tyr Val Thr Val Ser Asp Thr Gly Arg Arg 

115 120 125 

Trp Leu Val Asp Ala Ala Arg Gly Gly Pro Tyr Leu Tyr Gin Ala Leu 

130 135 140 

Val Pro Ser Ser Leu Ala Ala Asp Val Ala Cys Arg His Gly Leu His 
145 150 155 160 

Gly Pro Ala Gin Val Val Ser Thr Gly Cys Thr Ser Gly lie Asp Ala 

165 170 175 

He Gly Tyr Ala His Gin Leu He Ala Asp Gly Glu Ala Asp He Val 

180 185 190 

Leu Ala Gly Ala Ala Asp Ser Pro He Ser Pro Val Thr Val Ala Ser 

195 200 205 

Phe Asp Ala He Lys Ala Thr Ser Pro Asp Asn Asp Asp Pro Ala His 

210 215 220 

Ala Ser Arg Pro Phe Asp Ala Asp Arg His Gly Phe Val Leu Ala Glu 
225 230 235 240 

Gly Ala Ala Val Leu Val Leu Glu Glu Ala Gly His Ala Arg Arg Arg 

245 250 255 

Gly Ala His Val Tyr Cys Glu Val Ala Gly Tyr Ala Ser Arg Ser Asn 

260 265 270 

Gly Tyr His Met Thr Gly Leu Arg Pro Asp Gly Leu Glu Met Gly Leu 

275 280 285 

Ala He Ser Ala Ala Leu Lys Gin Gly Arg He Ala Pro Glu Gin Val 

290 295 300 

Ser Tyr He Ser Ala His Gly Ser Gly Thr Arg Gin Asn Asp Arg His 
305 310 315 320 

Glu Thr Ala Ala Phe Lys Arg Ala Leu Gly Gin Ala Ala Tyr Arg Val 

325 330 335 

Pro He Ser Ser He Lys Ser Met Val Gly His Ser Leu Gly Ala He 

340 345 350 

Gly Ser He Glu Met Ala Ala Cys Ala Leu Ala Val Glu Phe Gly Val 

355 360 365 

Val Pro Pro Thr Ala Asn Trp Thr Thr Arg Asp 
370 375 



<210> 55 
<211> 969 
<212> DNA 
<213> Bacteria 
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<220> 
<221> CDS 
<222> (1) . 



. (969) 



<400> 55 

atg ccc gcc aat tgg cga acc att cgt caa tac gcc ctg acg ccg ggg 4 8 

Met Pro Ala Asn Trp Arg Thr lie Arg Gin Tyr Ala Leu Thr Pro Gly 

15 10 15 

atg gcc cag acc acc ttc gcg acc egg ggc ttc cgc gcc egg gac gag 96 
Met Ala Gin Thr Thr Phe Ala Thr Arg Gly Phe Arg Ala Arg Asp Glu 
20 25 30 

ccg acc cgc gag egg ctg gag teg gtc ggc gcc cac ttc etc acc ggc 144 
Pro Thr Arg Glu Arg Leu Glu Ser Val Gly Ala His Phe Leu Thr Gly 
35 40 45 

tac ggg cac gcc gtc ggc gcc egg ggc ccg gac gag gcc gtc ggg gcg 192 
Tyr Gly His Ala Val Gly Ala Arg Gly Pro Asp Glu Ala Val Gly Ala 
50 55 60 

ctg gag acc gtc gcg ccg gac ctg cgc ggg ttc gcg tac gag ggc gcg 24 0 
Leu Glu Thr Val Ala Pro Asp Leu Arg Gly Phe Ala Tyr Glu Gly Ala 
65 70 75 80 

gcg atg ggc etc gcc gtc ctg gac ggg ctg acc ggt ggc cgc egg ate 28 8 
Ala Met Gly Leu Ala Val Leu Asp Gly Leu Thr Gly Gly Arg Arg lie 
85 90 95 

gcc egg ttc ctg gcc ggg ccg gcc gcc egg cac gtg tac atg gtc cat 33 6 
Ala Arg Phe Leu Ala Gly Pro Ala Ala Arg His Val Tyr Met Val His 
100 105 110 

gtc ggg gtg ggc tgg gcg atg gcc cgc ctg ccc cgc tgg cgt egg cac 384 
Val Gly Val Gly Trp Ala Met Ala Arg Leu Pro Arg Trp Arg Arg His 
115 120 125 

gcg ate caa ccc gcc gac egg ctg ctg ggc tgg ctg gcg ctg gac ggc 43 2 
Ala lie Gin Pro Ala Asp Arg Leu Leu Gly Trp Leu Ala Leu Asp Gly 
130 135 140 

tac gga ttc cac cag gcg tac ttc cac acc egg egg tac gtg tgg teg 480 
Tyr Gly Phe His Gin Ala Tyr Phe His Thr Arg Arg Tyr Val Trp Ser 
145 150 155 160 

cac egg cgt gac gag gtg ctg ccc tgg ccc ggc gac ccg ate ggg egg 528 
His Arg Arg Asp Glu Val Leu Pro Trp Pro Gly Asp Pro lie Gly Arg 
165 170 175 

tgg acc ggg cgc gtc gtg gac cag ggc ate ggc cgc gcg ctg tgg ttc 576 
Trp Thr Gly Arg Val Val Asp Gin Gly lie Gly Arg Ala Leu Trp Phe 
180 185 190 

gtc gag ggc gcc gac acc gac egg ate gcc gac acc gtc gac ggc ttc 624 
Val Glu Gly Ala Asp Thr Asp Arg lie Ala Asp Thr Val Asp Gly Phe 
195 200 205 

ccg ccg gac egg cac gag gac ctg tac age ggg gtg gcg ctg gcc gcc 6 72 
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Pro Pro Asp Arg His Glu Asp Leu Tyr Ser Gly Val Ala Leu Ala Ala 
210 215 220 



acg tac gcc ggc ggg gcg ccg ccc gag gac ctg egg egg ctg cgc gag 72 0 
Thr Tyr Ala Gly Gly Ala Pro Pro Glu Asp Leu Arg Arg Leu Arg Glu 
225 230 235 240 

c 9"c gga gcg tac gcc ccg gcg atg gcc cag ggc age gcc ttc gcg 768 

Arg Gly Gly Ala Tyr Ala Pro Ala Met Ala Gin Gly Ser Ala Phe Ala 
245 250 255 

gcg gag gcc egg gag cgc gcc ggg ctg acc acc gcg cac acc gcg gtc 816 
Ala Glu Ala Arg Glu Arg Ala Gly Leu Thr Thr Ala His Thr Ala Val 
260 265 270 

gcc acc gac gtc ttc tgc ggc gcg cca ccg gcc gag gcg gcg gcg gtc 864 
Ala Thr Asp Val Phe Cys Gly Ala Pro Pro Ala Glu Ala Ala Ala Val 
275 280 285 

acc cag gcc gcg ctg gcc gac etc gac egg gac ggg ccg gag ccg gcc 912 
Thr Gin Ala Ala Leu Ala Asp Leu Asp Arg Asp Gly Pro Glu Pro Ala 
290 295 300 

tac ctg gtg tgg egg cag egg ate gcc aag cag ttc gtg acg ctg ggg 96 0 
Tyr Leu Val Trp Arg Gin Arg lie Ala Lys Gin Phe Val Thr Leu Gly 
305 310 315 320 

agg tgc tga 969 
Arg Cys * 



<210> 56 
<211> 322 
<212> PRT 
<213> Bacteria 



<400> 56 
Met Pro Ala Asn 
1 

Met Ala Gin Thr 
20 

Pro Thr Arg Glu 
35 

Tyr Gly His Ala 
50 

Leu Glu Thr Val 
65 

Ala Met Gly Leu 

Ala Arg Phe Leu 
100 

Val Gly Val Gly 
115 

Ala lie Gin Pro 
130 

Tyr Gly Phe His 
145 



Trp Arg Thr lie 
5 

Thr Phe Ala Thr 

Arg Leu Glu Ser 
40 

Val Gly Ala Arg 
55 

Ala Pro Asp Leu 
70 

Ala Val Leu Asp 
85 

Ala Gly Pro Ala 

Trp Ala Met Ala 
120 

Ala Asp Arg Leu 
135 

Gin Ala Tyr Phe 
150 



Arg Gin Tyr Ala 
10 

Arg Gly Phe Arg 
25 

Val Gly Ala His 

Gly Pro Asp Glu 
60 

Arg Gly Phe Ala 
75 

Gly Leu Thr Gly 
90 

Ala Arg His Val 
105 

Arg Leu Pro Arg 

Leu Gly Trp Leu 
140 

His Thr Arg Arg 
155 



Leu Thr Pro Gly 
15 

Ala Arg Asp Glu 
30 

Phe Leu Thr Gly 
45 

Ala Val Gly Ala 

Tyr Glu Gly Ala 
80 

Gly Arg Arg lie 
95 

Tyr Met Val His 
110 

Trp Arg Arg His 
125 

Ala Leu Asp Gly 

Tyr Val Trp Ser 
160 
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His Arg Arg Asp Glu 
165 

Trp Thr Gly Arg Val 
180 

Val Glu Gly Ala Asp 
195 

Pro Pro Asp Arg His 
210 

Thr Tyr Ala Gly Gly 
225 

Arg Gly Gly Ala Tyr 
245 

Ala Glu Ala Arg Glu 
260 

Ala Thr Asp Val Phe 
275 

Thr Gin Ala Ala Leu 
290 

Tyr Leu Val Trp Arg 
305 

Arg Cys 



Val Leu Pro Trp Pro 
170 

Val Asp Gin Gly lie 
185 

Thr Asp Arg lie Ala 
200 

Glu Asp Leu Tyr Ser 
215 

Ala Pro Pro Glu Asp 
230 

Ala Pro Ala Met Ala 
250 

Arg Ala Gly Leu Thr 
265 

Cys Gly Ala Pro Pro 
280 

Ala Asp Leu Asp Arg 
295 

Gin Arg lie Ala Lys 
310 



Gly Asp Pro lie Gly Arg 
175 

Gly Arg Ala Leu Trp Phe 
190 

Asp Thr Val Asp Gly Phe 
205 

Gly Val Ala Leu Ala Ala 
220 

Leu Arg Arg Leu Arg Glu 
235 240 
Gin Gly Ser Ala Phe Ala 
255 

Thr Ala His Thr Ala Val 
270 

Ala Glu Ala Ala Ala Val 
285 

Asp Gly Pro Glu Pro Ala 
300 

Gin Phe Val Thr Leu Gly 
315 320 



<210> 57 

<211> 1956 

<212> DNA 

<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (1956) 

<400> 57 

atg ttc cgc egg cag ttg gec ggg ctg gtc gcg ctg gtg ctg etc acc 4 8 

Met Phe Arg Arg Gin Leu Ala Gly Leu Val Ala Leu Val Leu Leu Thr 
15 10 15 

ggc atg tac gtg ctg gtc egg cag ccg gag gcg aac gec gac gag egg 96 
Gly Met Tyr Val Leu Val Arg Gin Pro Glu Ala Asn Ala Asp Glu Arg 
20 25 30 

cgc gec atg gcg gag ccg tac egg ttc acg ccg atg teg ctg ccg atg 144 
Arg Ala Met Ala Glu Pro Tyr Arg Phe Thr Pro Met Ser Leu Pro Met 
35 40 45 

cc 9" <?<? c 99 c ct 9" cc 9 ca 9 ca 9 tc 9 atc c 9" c C H9" 9 tc aac 9<3 C S^g tac 192 
Pro Gly Gly Leu Pro Gin Gin Ser lie Arg Arg Val Asn Gly Ala Tyr 
50 55 60 

cag cac ctg gcg gcg tgg ate tec tec gtc ggc gee ggc gec gcg atg 240 
Gin His Leu Ala Ala Trp lie Ser Ser Val Gly Ala Gly Ala Ala Met 
65 70 75 80 

aac gac ctg gac ggt gac gga ctg gec aac gac ctg tgc gtc acc gac 288 
Asn Asp Leu Asp Gly Asp Gly Leu Ala Asn Asp Leu Cys Val Thr Asp 
85 90 95 
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cca cgc gtc gac cgc gtc gtg gtg acc ccg gcc ccg acc gcc ggc gcc 336 
Pro Arg Val Asp Arg Val Val Val Thr Pro Ala Pro Thr Ala Gly Ala 
100 105 110 

gac cgc tac cag ccg ttc gtg ctg gac ccg gcg ccg ctg ccg atg aac 384 
Asp Arg Tyr Gin Pro Phe Val Leu Asp Pro Ala Pro Leu Pro Met Asn 
115 120 125 

ccg tac gtc gcc ccg atg ggc tgc ctg ccc ggc gac etc aac gcc gac 432 
Pro Tyr Val Ala Pro Met Gly Cys Leu Pro Gly Asp Leu Asn Ala Asp 
130 135 140 

ggc cgc acc gac ctg etc gtg tac tgg tgg ggc egg acc ccg gtg gtc 480 
Gly Arg Thr Asp Leu Leu Val Tyr Trp Trp Gly Arg Thr Pro Val Val 
145 150 155 160 

ttc ctg gcc egg gcg gac gcg acc ggg ctg tec egg gcc gcc tac cac 528 
Phe Leu Ala Arg Ala Asp Ala Thr Gly Leu Ser Arg Ala Ala Tyr His 
165 170 175 

ccc gtc gag ctg gtg ccg ggc gcg gcg acc ggc ggt age egg tac gac 576 
Pro Val Glu Leu Val Pro Gly Ala Ala Thr Gly Gly Ser Arg Tyr Asp 
180 185 190 

ggg ccg aag tgg aac acc aac gcc gcg acg ctg gcc gac ttc gac ggc 624 
Gly Pro Lys Trp Asn Thr Asn Ala Ala Thr Leu Ala Asp Phe Asp Gly 
195 200 205 

gac ggg cac ctg gac gtc tac ate ggc aac tac ttc ccc gac age gcc 672 
Asp Gly His Leu Asp Val Tyr lie Gly Asn Tyr Phe Pro Asp Ser Ala 
210 215 220 

gtc etc gac gac acc gtc cac ggc ggg gtg gcg atg aac egg tec atg 72 0 
Val Leu Asp Asp Thr Val His Gly Gly Val Ala Met Asn Arg Ser Met 
225 230 235 240 

tec aac ggc etc aac ggc ggc gag gac cac gtg ttc egg tgg acc ggc 76 8 
Ser Asn Gly Leu Asn Gly Gly Glu Asp His Val Phe Arg Trp Thr Gly 
245 250 255 

ggc acc gcc ggc gcc acg ccg age gcc tec ttc gcc gag gtc ccg gac 816 
Gly Thr Ala Gly Ala Thr Pro Ser Ala Ser Phe Ala Glu Val Pro Asp 
260 265 270 

gtc ttc gac acc aag gtc tec egg ggc tgg acg etc gcc gtc gcc gcg 864 
Val Phe Asp Thr Lys Val Ser Arg Gly Trp Thr Leu Ala Val Ala Ala 
275 280 285 

aac gac etc gac ggc gac caa ctg ccc gag ctg tae gtg gcc aac gac 912 
Asn Asp Leu Asp Gly Asp Gin Leu Pro Glu Leu Tyr Val Ala Asn Asp 
290 295 300 

ttc ggg ccg gac egg ctg ctg cac aac egg teg gag egg ggg egg ate 960 
Phe Gly Pro Asp Arg Leu Leu His Asn Arg Ser Glu Arg Gly Arg lie 
305 310 315 320 

gcc ttc gcg ccg gtc gag age ccc ggg ctg ccc ggc ctg acc ccc aag 1008 
Ala Phe Ala Pro Val Glu Ser Pro Gly Leu Pro Gly Leu Thr Pro Lys 
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325 



330 



335 



tea aag egg etc ggc cac gac teg ttc aag ggc atg ggc gtg gac ttc 1056 
Ser Lys Arg Leu Gly His Asp Ser Phe Lys Gly Met Gly Val Asp Phe 
340 345 350 

ggc gac ate gac ggc gac ggc atg ttc gac ctg tac gtc ggc aac ate 1104 
Gly Asp lie Asp Gly Asp Gly Met Phe Asp Leu Tyr Val Gly Asn lie 
355 360 365 

acc ace tec ttc ggc ate cag gag age aac ttc gec ttc gtc aac acc 1152 
Thr Thr Ser Phe Gly lie Gin Glu Ser Asn Phe Ala Phe Val Asn Thr 
370 375 380 

gee gec gac acc gec gcg ctg cgc gec gcg ctg tgg gec ggc gag gcg 12 0 0 
Ala Ala Asp Thr Ala Ala Leu Arg Ala Ala Leu Trp Ala Gly Glu Ala 
385 390 395 400 

ccg tgg cac gac cgc age gec gag ctg ggc ctg gee tgg age ggg tgg 1248 
Pro Trp His Asp Arg Ser Ala Glu Leu Gly Leu Ala Trp Ser Gly Trp 
405 410 415 

age tgg gac gtc aag ttc ggc gac ttc acc aac cgc ggc gac ccg gcg 12 96 
Ser Trp Asp Val Lys Phe Gly Asp Phe Thr Asn Arg Gly Asp Pro Ala 
420 425 430 

ate gtg cag acc tec ggc ttc gtc aag ggc gag gtc aac cgc tgg gcg 1344 
lie Val Gin Thr Ser Gly Phe Val Lys Gly Glu Val Asn Arg Trp Ala 
435 440 445 

cag ttg cag gag gcg gec acc gee aac gac gac ctg etc gee aac ccc 13 92 
Gin Leu Gin Glu Ala Ala Thr Ala Asn Asp Asp Leu Leu Ala Asn Pro 
450 455 460 

cgc tgg tgg ccg aag gtc gag cag ggc gac gac ate gee ggc ggc cag 1440 
Arg Trp Trp Pro Lys Val Glu Gin Gly Asp Asp lie Ala Gly Gly Gin 
465 470 475 480 

cac etc gec ttc cac gtc egg ggc gec gac ggc cgc tac gag gac etc 1488 
His Leu Ala Phe His Val Arg Gly Ala Asp Gly Arg Tyr Glu Asp Leu 
485 490 495 

age cac gaa ctg ggc ctg gee gac egg gtg ccc age egg ggc ate gee 1536 
Ser His Glu Leu Gly Leu Ala Asp Arg Val Pro Ser Arg Gly lie Ala 
500 505 510 

acc gee gac gee gac ggc gac ggg cgc etc gac etc gtc gtc gee egg 1584 
Thr Ala Asp Ala Asp Gly Asp Gly Arg Leu Asp Leu Val Val Ala Arg 
515 520 525 

cag tgg gac gcg ccg gtc ttc tac cgc aac gac age ccg gac acc ggt 1632 
Gin Trp Asp Ala Pro Val Phe Tyr Arg Asn Asp Ser Pro Asp Thr Gly 
530 535 540 

tec ttc etc acc ctg egg ctg ctg cac gag cag gcg ccg gec gec ggc 1680 
Ser Phe Leu Thr Leu Arg Leu Leu His Glu Gin Ala Pro Ala Ala Gly 
545 550 555 560 
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ccc etc gee ggg gcg ggg teg ccg gtc gtc ggc gcg cag gtc egg gtg 1728 
Pro Leu Ala Gly Ala Gly Ser Pro Val Val Gly Ala Gin Val Arg Val 
565 570 575 



acc acg ccg gac ggc egg gtg etc ate gac egg gtc gac ggc ggc age 1776 
Thr Thr Pro Asp Gly Arg Val Leu lie Asp Arg Val Asp Gly Gly Ser 
580 585 590 



ggc cac teg ggc egg cgc age aac gag gtg teg etc ggt etc gac gac 1824 
Gly His Ser Gly Arg Arg Ser Asn Glu Val Ser Leu Gly Leu Asp Asp 
595 600 605 



gtg acc ggc ccg gtg teg gtc cac etc acc tgg egg gac egg tec ggc 1872 
Val Thr Gly Pro Val Ser Val His Leu Thr Trp Arg Asp Arg Ser Gly 
610 615 620 



gee ccg cac gag cag gag ctg acg ctg gec ccc ggt cga cac acc etc 192 0 
Ala Pro His Glu Gin Glu Leu Thr Leu Ala Pro Gly Arg His Thr Leu 
625 630 635 640 



acc etc ggt teg cag get egg gag gtc teg cga tga 
Thr Leu Gly Ser Gin Ala Arg Glu Val Ser Arg * 
645 650 



1956 



<210> 58 
<211> 651 
<212> PRT 
<213> Bacteria 



<400> 58 
Met Phe Arg Arg 
1 

Gly Met Tyr Val 
20 

Arg Ala Met Ala 
35 

Pro Gly Gly Leu 
50 

Gin His Leu Ala 
65 

Asn Asp Leu Asp 

Pro Arg Val Asp 
100 

Asp Arg Tyr Gin 
115 

Pro Tyr Val Ala 
130 

Gly Arg Thr Asp 
145 

Phe Leu Ala Arg 

Pro Val Glu Leu 
180 

Gly Pro Lys Trp 
195 

Asp Gly His Leu 



Gin Leu Ala Gly 
5 

Leu Val Arg Gin 

Glu Pro Tyr Arg 
40 

Pro Gin Gin Ser 
55 

Ala Trp lie Ser 
70 

Gly Asp Gly Leu 
85 

Arg Val Val Val 

Pro Phe Val Leu 
120 

Pro Met Gly Cys 
135 

Leu Leu Val Tyr 
150 

Ala Asp Ala Thr 
165 

Val Pro Gly Ala 

Asn Thr Asn Ala 
200 

Asp Val Tyr lie 



Leu Val Ala Leu 
10 

Pro Glu Ala Asn 
25 

Phe Thr Pro Met 

lie Arg Arg Val 
60 

Ser Val Gly Ala 
75 

Ala Asn Asp Leu 
90 

Thr Pro Ala Pro 
105 

Asp Pro Ala Pro 

Leu Pro Gly Asp 
140 

Trp Trp Gly Arg 
155 

Gly Leu Ser Arg 
170 

Ala Thr Gly Gly 
185 

Ala Thr Leu Ala 
Gly Asn Tyr Phe 



Val Leu Leu Thr 
15 

Ala Asp Glu Arg 
30 

Ser Leu Pro Met 
45 

Asn Gly Ala Tyr 

Gly Ala Ala Met 
80 

Cys Val Thr Asp 
95 

Thr Ala Gly Ala 
110 

Leu Pro Met Asn 
125 

Leu Asn Ala Asp 

Thr Pro Val Val 
160 

Ala Ala Tyr His 
175 

Ser Arg Tyr Asp 
190 

Asp Phe Asp Gly 
205 

Pro Asp Ser Ala 
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210 

Val Leu Asp Asp 
225 

Ser Asn Gly Leu 

Gly Thr Ala Gly 
260 

Val Phe Asp Thr 
275 

Asn Asp Leu Asp 
290 

Phe Gly Pro Asp 
305 

Ala Phe Ala Pro 

Ser Lys Arg Leu 
340 

Gly Asp lie Asp 
355 

Thr Thr Ser Phe 
370 

Ala Ala Asp Thr 
385 

Pro Trp His Asp 

Ser Trp Asp Val 
420 

lie Val Gin Thr 
435 

Gin Leu Gin Glu 
450 

Arg Trp Trp Pro 
465 

His Leu Ala Phe 

Ser His Glu Leu 
500 

Thr Ala Asp Ala 
515 

Gin Trp Asp Ala 
530 

Ser Phe Leu Thr 
545 

Pro Leu Ala Gly 

Thr Thr Pro Asp 
580 

Gly His Ser Gly 
595 

Val Thr Gly Pro 
610 

Ala Pro His Glu 
625 

Thr Leu Gly Ser 



215 

Thr Val His Gly 
230 

Asn Gly Gly Glu 
245 

Ala Thr Pro Ser 

Lys Val Ser Arg 
280 

Gly Asp Gin Leu 
295 

Arg Leu Leu His 
310 

Val Glu Ser Pro 
325 

Gly His Asp Ser 

Gly Asp Gly Met 
360 

Gly lie Gin Glu 
375 

Ala Ala Leu Arg 
390 

Arg Ser Ala Glu 
405 

Lys Phe Gly Asp 

Ser Gly Phe Val 
440 

Ala Ala Thr Ala 
455 

Lys Val Glu Gin 
470 

His Val Arg Gly 
485 

Gly Leu Ala Asp 

Asp Gly Asp Gly 
520 

Pro Val Phe Tyr 
535 

Leu Arg Leu Leu 
550 

Ala Gly Ser Pro 
565 

Gly Arg Val Leu 

Arg Arg Ser Asn 
600 

Val Ser Val His 
615 

Gin Glu Leu Thr 
630 

Gin Ala Arg Glu 
645 



220 

Gly Val Ala Met 
235 

Asp His Val Phe 
250 

Ala Ser Phe Ala 
265 

Gly Trp Thr Leu 

Pro Glu Leu Tyr 
300 

Asn Arg Ser Glu 
315 

Gly Leu Pro Gly 
330 

Phe Lys Gly Met 
345 

Phe Asp Leu Tyr 

Ser Asn Phe Ala 
380 

Ala Ala Leu Trp 
395 

Leu Gly Leu Ala 
410 

Phe Thr Asn Arg 
425 

Lys Gly Glu Val 

Asn Asp Asp Leu 
460 

Gly Asp Asp lie 
475 

Ala Asp Gly Arg 
490 

Arg Val Pro Ser 
505 

Arg Leu Asp Leu 

Arg Asn Asp Ser 
540 

His Glu Gin Ala 
555 

Val Val Gly Ala 
570 

lie Asp Arg Val 
585 

Glu Val Ser Leu 

Leu Thr Trp Arg 
620 

Leu Ala Pro Gly 
635 

Val Ser Arg 
650 



Asn Arg Ser Met 
240 

Arg Trp Thr Gly 
255 

Glu Val Pro Asp 
270 

Ala Val Ala Ala 
285 

Val Ala Asn Asp 

Arg Gly Arg lie 
320 

Leu Thr Pro Lys 
335 

Gly Val Asp Phe 
350 

Val Gly Asn lie 
365 

Phe Val Asn Thr 

Ala Gly Glu Ala 
400 

Trp Ser Gly Trp 
415 

Gly Asp Pro Ala 
430 

Asn Arg Trp Ala 
445 

Leu Ala Asn Pro 

Ala Gly Gly Gin 
480 

Tyr Glu Asp Leu 
495 

Arg Gly lie Ala 
510 

Val Val Ala Arg 
525 

Pro Asp Thr Gly 

Pro Ala Ala Gly 
560 

Gin Val Arg Val 
575 

Asp Gly Gly Ser 
590 

Gly Leu Asp Asp 
605 

Asp Arg Ser Gly 

Arg His Thr Leu 
640 



<210> 59 
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<211> 630 
<212> DNA 
<213> Bacteria 



<220> 

<221> CDS 

<222> (1) . . . (630) 

<400> 59 

at 9" < 3S C 9 aa ac 9" 99 a ca 9 ttg gcc gtc gtc acg gcg gac gcc gac 48 

Met Gly Glu Thr Gly Arg Gin Leu Ala Val Val Thr Ala Asp Ala Asp 
15 10 15 

gtc gtg gag gcg gag ctg gtg gac gac gag acg gcc ggc gcc tec gtc 96 
Val Val Glu Ala Glu Leu Val Asp Asp Glu Thr Ala Gly Ala Ser Val 
20 25 30 

gtc gtc cac acg gac cgc gac egg cac etc tec ccc gag acc gtc gcc 144 
Val Val His Thr Asp Arg Asp Arg His Leu Ser Pro Glu Thr Val Ala 
35 40 45 

gcc ate gcg gcg age gtc gcc gac tec acc cgc cgc gcg tac ggc acc 192 
Ala lie Ala Ala Ser Val Ala Asp Ser Thr Arg Arg Ala Tyr Gly Thr 
50 55 60 

gac egg gcc gcg ttc gcc gcc tgg tgc gcc gag gag gac cgc acg gcc 24 0 
Asp Arg Ala Ala Phe Ala Ala Trp Cys Ala Glu Glu Asp Arg Thr Ala 
65 70 75 80 

gtc ccc gcg teg gcg gag acg atg gcg gag tgg gtg egg cac ctg acc 288 
Val Pro Ala Ser Ala Glu Thr Met Ala Glu Trp Val Arg His Leu Thr 
85 90 95 

gtc acg ccc cgc ccc egg acg cag cga ccg gcc ggg ccg teg acc ate 336 
Val Thr Pro Arg Pro Arg Thr Gin Arg Pro Ala Gly Pro Ser Thr lie 
100 105 110 

gag egg gcc atg tec gcc gtg acc acc tgg cac gag gag cag gga egg 384 
Glu Arg Ala Met Ser Ala Val Thr Thr Trp His Glu Glu Gin Gly Arg 
115 120 125 

ccg aag ccg aac atg cgc ggc gcc egg gcc gtc etc aac gcc tac aag 432 
Pro Lys Pro Asn Met Arg Gly Ala Arg Ala Val Leu Asn Ala Tyr Lys 
130 135 140 

gac egg etc gcc gtg gag aag gcg gag gcc gcg cag gcc cgc cag gcg 480 
Asp Arg Leu Ala Val Glu Lys Ala Glu Ala Ala Gin Ala Arg Gin Ala 
145 150 155 160 

acc gcc gcc etc ccc ccg cag ate cgc gcc atg etc gcc ggg gtc gac 528 
Thr Ala Ala Leu Pro Pro Gin lie Arg Ala Met Leu Ala Gly Val Asp 
165 170 175 

egg acc acc etc gcc ggg aag egg aac gcg gcc tgg tec tec teg gtt 576 
Arg Thr Thr Leu Ala Gly Lys Arg Asn Ala Ala Trp Ser Ser Ser Val 
180 185 190 

cgc cac ggc ggc cgc gtc etc cga get ggt cgc age tgg acg teg aca 624 
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Arg His Gly Gly Arg Val Leu Arg Ala Gly Arg Ser Trp Thr Ser Thr 
195 200 205 



egg tga 
Arg * 



<210> 60 
<211> 209 
<212> PRT 
<213> Bacteria 

<400> 60 

Met Gly Glu Thr Gly Arg Gin Leu Ala Val Val Thr Ala Asp Ala Asp 

15 10 15 

Val Val Glu Ala Glu Leu Val Asp Asp Glu Thr Ala Gly Ala Ser Val 

20 25 30 

Val Val His Thr Asp Arg Asp Arg His Leu Ser Pro Glu Thr Val Ala 

35 40 45 

Ala lie Ala Ala Ser Val Ala Asp Ser Thr Arg Arg Ala Tyr Gly Thr 

50 55 60 

Asp Arg Ala Ala Phe Ala Ala Trp Cys Ala Glu Glu Asp Arg Thr Ala 
65 70 75 80 

Val Pro Ala Ser Ala Glu Thr Met Ala Glu Trp Val Arg His Leu Thr 

85 90 95 

Val Thr Pro Arg Pro Arg Thr Gin Arg Pro Ala Gly Pro Ser Thr lie 

100 105 110 

Glu Arg Ala Met Ser Ala Val Thr Thr Trp His Glu Glu Gin Gly Arg 

115 120 125 

Pro Lys Pro Asn Met Arg Gly Ala Arg Ala Val Leu Asn Ala Tyr Lys 

130 135 140 

Asp Arg Leu Ala Val Glu Lys Ala Glu Ala Ala Gin Ala Arg Gin Ala 
145 150 155 160 

Thr Ala Ala Leu Pro Pro Gin lie Arg Ala Met Leu Ala Gly Val Asp 

165 170 175 

Arg Thr Thr Leu Ala Gly Lys Arg Asn Ala Ala Trp Ser Ser Ser Val 

180 185 190 

Arg His Gly Gly Arg Val Leu Arg Ala Gly Arg Ser Trp Thr Ser Thr 
195 200 205 

Arg 



<210> 61 
<211> 1566 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (1566) 

<400> 61 

gtg ttc ttc gag gac tgc acc etc gcg gag gec acc tat ccc acg ctg 
Val Phe Phe Glu Asp Cys Thr Leu Ala Glu Ala Thr Tyr Pro Thr Leu 
15 10 15 
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ttc gcc ggg gtg gac gtc gtg ccg tec age gte gac etc cag cgc gtg 96 
Phe Ala Gly Val Asp Val Val Pro Ser Ser Val Asp Leu Gin Arg Val 
20 25 30 



gag tac gaa egg ccc ate ggc gcg gag cag ggc etc gcc gcc gcc ctg 144 
Glu Tyr Glu Arg Pro lie Gly Ala Glu Gin Gly Leu Ala Ala Ala Leu 
35 40 45 

gcg cag gag gcg gag gag gcc ggc ggc cgc tec ccg tac gac gtg acc 192 
Ala Gin Glu Ala Glu Glu Ala Gly Gly Arg Ser Pro Tyr Asp Val Thr 
50 55 60 

ctg ate gac gcc gcg ccg tec etc ggg ctg gtc acc gtt gcc gca etc 240 
Leu lie Asp Ala Ala Pro Ser Leu Gly Leu Val Thr Val Ala Ala Leu 
65 70 75 80 

acc gcc gcc gac gag gcc ctg gtg ccc ate aag gtc ggc ggc ctg gac 2 88 
Thr Ala Ala Asp Glu Ala Leu Val Pro lie Lys Val Gly Gly Leu Asp 
85 90 95 

atg aag gcc atg gcg tec etc cac aag acg etc cgc age gtc cag egg 336 
Met Lys Ala Met Ala Ser Leu His Lys Thr Leu Arg Ser Val Gin Arg 
100 105 110 

aag acg aac ccg aag ctg age gtc ggg gcc gtc ctg ctg acc gcg tgg 3 84 
Lys Thr Asn Pro Lys Leu Ser Val Gly Ala Val Leu Leu Thr Ala Trp 
115 120 125 

gac aag age acg ttt gcc egg cag etc gcc acg aag gtg age gag gac 432 
Asp Lys Ser Thr Phe Ala Arg Gin Leu Ala Thr Lys Val Ser Glu Asp 
130 135 140 

tac ccg gag gcg gcc gtc gtg ccg ate egg cgc age ate cgc gcg teg 48 0 
Tyr Pro Glu Ala Ala Val Val Pro lie Arg Arg Ser He Arg Ala Ser 
145 150 155 160 

gag gcc ccg etc tec gag gag ccg ate cgc ctg tac gcg ccc gag gcg 52 8 
Glu Ala Pro Leu Ser Glu Glu Pro He Arg Leu Tyr Ala Pro Glu Ala 
165 170 175 

gcc ccg gcc ggg gac tac gac cag tgc ggc cgc cgt cct cct gcc ggg 5 76 
Ala Pro Ala Gly Asp Tyr Asp Gin Cys Gly Arg Arg Pro Pro Ala Gly 
180 185 190 

gag ggc tgc cgc gtg age cgc cgc tec etc gcc etc ccg teg acc agg 624 
Glu Gly Cys Arg Val Ser Arg Arg Ser Leu Ala Leu Pro Ser Thr Arg 
195 200 205 

age acc gag ccg gac cac gcc gac gag ctg gag gcc gcc ccc gaa gag 672 
Ser Thr Glu Pro Asp His Ala Asp Glu Leu Glu Ala Ala Pro Glu Glu 
210 215 220 

aag etc gcg gcc gcg egg tec gcc ggg gtg gtc gcc teg ctg acc ggc 72 0 
Lys Leu Ala Ala Ala Arg Ser Ala Gly Val Val Ala Ser Leu Thr Gly 
225 230 235 240 

gcg gac ctg teg acg ccc etc acc gtg gcg cag etc ccc acg ccg tac 768 
Ala Asp Leu Ser Thr Pro Leu Thr Val Ala Gin Leu Pro Thr Pro Tyr 
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245 



250 



255 



gac gtc gcg gag acc gtc acg gcg ccg ctg aac gac cag gag cgc ggt 816 
Asp Val Ala Glu Thr Val Thr Ala Pro Leu Asn Asp Gin Glu Arg Gly 
260 265 270 

tac ctg gac gtg tgc gag cag gcc etc cac ggc ttc egg aag tec gtc 864 
Tyr Leu Asp Val Cys Glu Gin Ala Leu His Gly Phe Arg Lys Ser Val 
275 280 285 

gtc gtc gcg ggc aag gcc ctg gag gtc ate aac cgc ggc cgc etc tac 912 
Val Val Ala Gly Lys Ala Leu Glu Val lie Asn Arg Gly Arg Leu Tyr 
290 295 300 

egg gag acg cac gag acg ttc gcg gac tac gtg acg gag gtg tgg gac 96 0 
Arg Glu Thr His Glu Thr Phe Ala Asp Tyr Val Thr Glu Val Trp Asp 
305 310 315 320 

atg aag egg gcc cac gcc tat egg atg ate gag ggg tgg cga ccg gcc 1008 
Met Lys Arg Ala His Ala Tyr Arg Met lie Glu Gly Trp Arg Pro Ala 
325 330 335 

gac etc gtg tct cca att gga gac ate aac gag ggc cag gcc cgc gag 1056 
Asp Leu Val Ser Pro He Gly Asp He Asn Glu Gly Gin Ala Arg Glu 
340 345 350 

ctg gcg ccc gtg etc aag gag tac ggg ccc gag gtg acc gtc acc ctg 1104 
Leu Ala Pro Val Leu Lys Glu Tyr Gly Pro Glu Val Thr Val Thr Leu 
355 360 365 

tac egg ggg gtc aag gag ctg cgc ggc gac egg egg gtg acg get gcg 1152 
Tyr Arg Gly Val Lys Glu Leu Arg Gly Asp Arg Arg Val Thr Ala Ala 
370 375 380 

gac etc teg gag gcc egg gca gcg ctg cct ccg ccg aag cac etc gcc 12 00 
Asp Leu Ser Glu Ala Arg Ala Ala Leu Pro Pro Pro Lys His Leu Ala 
385 390 395 400 

egg ccg gac cag gtg cgc gac gtc etc acc gtg gcg gcc gcc gag ggc 1248 
Arg Pro Asp Gin Val Arg Asp Val Leu Thr Val Ala Ala Ala Glu Gly 
405 410 415 

cgc gcg ccc egg etc gcc ccg gcc gag ccg aag gtg ccg gcc cag gcc 12 96 
Arg Ala Pro Arg Leu Ala Pro Ala Glu Pro Lys Val Pro Ala Gin Ala 
420 425 430 

gcc gac gag cac cag gcc gag cag gtc gac gag ggc ggc gta agt cag 1344 
Ala Asp Glu His Gin Ala Glu Gin Val Asp Glu Gly Gly Val Ser Gin 
435 440 445 

gac cag gtc gac gag ggc gcg gag gcc ate gcc acc ctg gag gcc gcc 13 92 
Asp Gin Val Asp Glu Gly Ala Glu Ala He Ala Thr Leu Glu Ala Ala 
450 455 460 

gtg gcc cag caa egg cag ate tat gac egg gtg ggc ggc ggg act etc 1440 
Val Ala Gin Gin Arg Gin He Tyr Asp Arg Val Gly Gly Gly Thr Leu 
465 470 475 480 
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gcg gcc gcc ctg 
Ala Ala Ala Leu 



gag ctg egg cag 
Glu Leu Arg Gin 
500 

tec ggt gag cag 
Ser Gly Glu Gin 
515 



<210> 62 
<211> 521 
<212> PRT 
<213> Bacteria 

<400> 62 
Val Phe Phe Glu 
1 

Phe Ala Gly Val 
20 

Glu Tyr Glu Arg 
35 

Ala Gin Glu Ala 
50 

Leu lie Asp Ala 
65 

Thr Ala Ala Asp 

Met Lys Ala Met 
100 

Lys Thr Asn Pro 
115 

Asp Lys Ser Thr 
130 

Tyr Pro Glu Ala 
145 

Glu Ala Pro Leu 

Ala Pro Ala Gly 
180 

Glu Gly Cys Arg 
195 

Ser Thr Glu Pro 
210 

Lys Leu Ala Ala 
225 

Ala Asp Leu Ser 

Asp Val Ala Glu 
260 

Tyr Leu Asp Val 
275 

Val Val Ala Gly 
290 

Arg Glu Thr His 



ctg tac gac cca 
Leu Tyr Asp Pro 
485 

tac gcg cag egg 
Tyr Ala Gin Arg 



gtg gcc gac gac 
Val Ala Asp Asp 
520 



Asp Cys Thr Leu 
5 

Asp Val Val Pro 

Pro lie Gly Ala 
40 

Glu Glu Ala Gly 
55 

Ala Pro Ser Leu 
70 

Glu Ala Leu Val 
85 

Ala Ser Leu His 

Lys Leu Ser Val 
120 

Phe Ala Arg Gin 
135 

Ala Val Val Pro 
150 

Ser Glu Glu Pro 
165 

Asp Tyr Asp Gin 

Val Ser Arg Arg 
200 

Asp His Ala Asp 
215 

Ala Arg Ser Ala 
230 

Thr Pro Leu Thr 
245 

Thr Val Thr Ala 

Cys Glu Gin Ala 
280 

Lys Ala Leu Glu 
295 

Glu Thr Phe Ala 



ggc egg ggt gac 
Gly Arg Gly Asp 
490 

acg gcg tac egg 
Thr Ala Tyr Arg 
505 

gcg taa 
Ala * 



Ala Glu Ala Thr 
10 

Ser Ser Val Asp 
25 

Glu Gin Gly Leu 

Gly Arg Ser Pro 
60 

Gly Leu Val Thr 
75 

Pro lie Lys Val 
90 

Lys Thr Leu Arg 
105 

Gly Ala Val Leu 

Leu Ala Thr Lys 
140 

lie Arg Arg Ser 
155 

lie Arg Leu Tyr 
170 

Cys Gly Arg Arg 
185 

Ser Leu Ala Leu 

Glu Leu Glu Ala 
220 

Gly Val Val Ala 
235 

Val Ala Gin Leu 
250 

Pro Leu Asn Asp 
265 

Leu His Gly Phe 

Val lie Asn Arg 
300 

Asp Tyr Val Thr 



cat ctg cgc cgc 1488 
His Leu Arg Arg 
495 

gca egg gat acc 153 6 
Ala Arg Asp Thr 
510 

1566 



Tyr Pro Thr Leu 
15 

Leu Gin Arg Val 
30 

Ala Ala Ala Leu 
45 

Tyr Asp Val Thr 

Val Ala Ala Leu 
80 

Gly Gly Leu Asp 
95 

Ser Val Gin Arg 
110 

Leu Thr Ala Trp 
125 

Val Ser Glu Asp 

lie Arg Ala Ser 
160 

Ala Pro Glu Ala 
175 

Pro Pro Ala Gly 
190 

Pro Ser Thr Arg 
205 

Ala Pro Glu Glu 

Ser Leu Thr Gly 
240 

Pro Thr Pro Tyr 
255 

Gin Glu Arg Gly 
270 

Arg Lys Ser Val 
285 

Gly Arg Leu Tyr 
Glu Val Trp Asp 
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305 310 315 320 

Met Lys Arg Ala His Ala Tyr Arg Met He Glu Gly Trp Arg Pro Ala 

325 330 335 

Asp Leu Val Ser Pro He Gly Asp He Asn Glu Gly Gin Ala Arg Glu 

340 345 350 

Leu Ala Pro Val Leu Lys Glu Tyr Gly Pro Glu Val Thr Val Thr Leu 

355 360 365 

Tyr Arg Gly Val Lys Glu Leu Arg Gly Asp Arg Arg Val Thr Ala Ala 

370 375 380 

Asp Leu Ser Glu Ala Arg Ala Ala Leu Pro Pro Pro Lys His Leu Ala 
385 390 395 400 

Arg Pro Asp Gin Val Arg Asp Val Leu Thr Val Ala Ala Ala Glu Gly 

405 410 415 

Arg Ala Pro Arg Leu Ala Pro Ala Glu Pro Lys Val Pro Ala Gin Ala 

420 425 430 

Ala Asp Glu His Gin Ala Glu Gin Val Asp Glu Gly Gly Val Ser Gin 

435 440 445 

Asp Gin Val Asp Glu Gly Ala Glu Ala He Ala Thr Leu Glu Ala Ala 

450 455 460 

Val Ala Gin Gin Arg Gin He Tyr Asp Arg Val Gly Gly Gly Thr Leu 
465 470 475 480 

Ala Ala Ala Leu Leu Tyr Asp Pro Gly Arg Gly Asp His Leu Arg Arg 

485 490 495 

Glu Leu Arg Gin Tyr Ala Gin Arg Thr Ala Tyr Arg Ala Arg Asp Thr 

500 505 510 

Ser Gly Glu Gin Val Ala Asp Asp Ala 
515 520 



<210> 63 

<211> 528 

<212> DNA 

<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (528) 

<400> 63 

atg gga gag gcg cga gtg ccg acg agg aag cgc ggg ccg aac atg gcc 48 

Met Gly Glu Ala Arg Val Pro Thr Arg Lys Arg Gly Pro Asn Met Ala 
15 10 15 

c tg gtc aac atg gac acc gga gag gcg gtg tec gcc agg ccg egg act 96 
Leu Val Asn Met Asp Thr Gly Glu Ala Val Ser Ala Arg Pro Arg Thr 
20 25 30 

ccg cac cag ttc gac ggg aag ggg tac acc ttg cag gcc gta ggc age 144 
Pro His Gin Phe Asp Gly Lys Gly Tyr Thr Leu Gin Ala Val Gly Ser 
35 40 45 

gac gtc ccc ctg tac tec etc ggg ctg gcc gca gcg gag tgg gcg acg 192 
Asp Val Pro Leu Tyr Ser Leu Gly Leu Ala Ala Ala Glu Trp Ala Thr 
50 55 60 

etc gaa tgg etc cgc gaa cac gga ggc gcg gcc gga tac gtc ccg gtc 240 
Leu Glu Trp Leu Arg Glu His Gly Gly Ala Ala Gly Tyr Val Pro Val 
65 70 75 80 
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acg ccc gag gag ctg ggc gag gac gtc ggc gcc age aag gac acc tgc 288 
Thr Pro Glu Glu Leu Gly Glu Asp Val Gly Ala Ser Lys Asp Thr Cys 
85 90 95 



egg aag gcc ctt aac egg ctg gtc aag etc ggg ctt gtg gtc aag ccg 33 6 
Arg Lys Ala Leu Asn Arg Leu Val Lys Leu Gly Leu Val Val Lys Pro 
100 105 110 

ggc ccg cga tec ggc tct tac cag ctg aac ccc etc cga tac tgg gag 3 84 
Gly Pro Arg Ser Gly Ser Tyr Gin Leu Asn Pro Leu Arg Tyr Trp Glu 
115 120 125 

gga gcc ggg age acg cag gtc aac gcc tgc cgc cgc atg gcg ccg ccg 432 
Gly Ala Gly Ser Thr Gin Val Asn Ala Cys Arg Arg Met Ala Pro Pro 
130 135 140 

cgt gtg gcc ccg gac gac aag gcc atg acc agg tec gcc age aag ccc 480 
Arg Val Ala Pro Asp Asp Lys Ala Met Thr Arg Ser Ala Ser Lys Pro 
145 150 155 160 

aag acc ate ccg get acc cgc cgc cgc gcc gca gga gag acg cga tga 528 
Lys Thr lie Pro Ala Thr Arg Arg Arg Ala Ala Gly Glu Thr Arg * 
165 170 175 



<210> 64 
<211> 175 
<212> PRT 
<213> Bacteria 

<400> 64 

Met Gly Glu Ala Arg Val Pro Thr Arg Lys Arg Gly Pro Asn Met Ala 

15 10 15 

Leu Val Asn Met Asp Thr Gly Glu Ala Val Ser Ala Arg Pro Arg Thr 

20 25 30 

Pro His Gin Phe Asp Gly Lys Gly Tyr Thr Leu Gin Ala Val Gly Ser 

35 40 45 

Asp Val Pro Leu Tyr Ser Leu Gly Leu Ala Ala Ala Glu Trp Ala Thr 

50 55 60 

Leu Glu Trp Leu Arg Glu His Gly Gly Ala Ala Gly Tyr Val Pro Val 
65 70 75 80 

Thr Pro Glu Glu Leu Gly Glu Asp Val Gly Ala Ser Lys Asp Thr Cys 

85 90 95 

Arg Lys Ala Leu Asn Arg Leu Val Lys Leu Gly Leu Val Val Lys Pro 

100 105 110 

Gly Pro Arg Ser Gly Ser Tyr Gin Leu Asn Pro Leu Arg Tyr Trp Glu 

115 120 125 

Gly Ala Gly Ser Thr Gin Val Asn Ala Cys Arg Arg Met Ala Pro Pro 

130 135 140 

Arg Val Ala Pro Asp Asp Lys Ala Met Thr Arg Ser Ala Ser Lys Pro 
145 150 155 160 

Lys Thr lie Pro Ala Thr Arg Arg Arg Ala Ala Gly Glu Thr Arg 
165 170 175 



<210> 65 
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<211> 420 
<212> DNA 
<213> Bacteria 



<220> 

<221> CDS 

<222> (1) . . . (420) 

<400> 65 

atg acg acc atg ccc gta gaa ggc ttc aac ccg gag cgc gac ctg acc 4 8 

Met Thr Thr Met Pro Val Glu Gly Phe Asn Pro Glu Arg Asp Leu Thr 
15 10 15 

gcc ccg teg ctg tac teg ctg aac ctg tec gec get cag cac tgc acg 96 
Ala Pro Ser Leu Tyr Ser Leu Asn Leu Ser Ala Ala Gin His Cys Thr 
20 25 30 

etc gcg tgg gtg gag gac cac ggc ggc ctg ttt gac gtc ate ccc gta 144 
Leu Ala Trp Val Glu Asp His Gly Gly Leu Phe Asp Val lie Pro Val 
35 40 45 

ccg gtc gaa acc gtc gcc gag gac tgc ggc aac tec gtc tec acg gtg 192 
Pro Val Glu Thr Val Ala Glu Asp Cys Gly Asn Ser Val Ser Thr Val 
50 55 60 

cac gag get etc gcc cgc ctg gag gcc ctg aac etc etc gtg egg acc 24 0 
His Glu Ala Leu Ala Arg Leu Glu Ala Leu Asn Leu Leu Val Arg Thr 
65 70 75 80 

tec gcc ggc etc tac egg ate aac gcc egg tac tac ttc acg ctg cac 288 
Ser Ala Gly Leu Tyr Arg lie Asn Ala Arg Tyr Tyr Phe Thr Leu His 
85 90 95 

ccc gag ctg cgc gag atg ate acc gcc gcc etc acg gac ccc ccg gtc 336 
Pro Glu Leu Arg Glu Met lie Thr Ala Ala Leu Thr Asp Pro Pro Val 
100 105 110 

acc ccg gac gac cgt gcc cgc gcg ccc cgc aag gtc age aac acc gac 384 
Thr Pro Asp Asp Arg Ala Arg Ala Pro Arg Lys Val Ser Asn Thr Asp 
115 120 125 

get cgc cgc cgc egg acg ate cgc ccc gtc tct tga 42 0 

Ala Arg Arg Arg Arg Thr lie Arg Pro Val Ser * 
130 135 



<210> 66 
<211> 139 
<212> PRT 
<213> Bacteria 

<400> 66 

Met Thr Thr Met Pro Val Glu Gly Phe Asn Pro Glu Arg Asp Leu Thr 

15 10 15 

Ala Pro Ser Leu Tyr Ser Leu Asn Leu Ser Ala Ala Gin His Cys Thr 

20 25 30 

Leu Ala Trp Val Glu Asp His Gly Gly Leu Phe Asp Val lie Pro Val 
35 40 45 
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Pro Val Glu Thr Val 
50 

His Glu Ala Leu Ala 
65 

Ser Ala Gly Leu Tyr 
85 

Pro Glu Leu Arg Glu 
100 

Thr Pro Asp Asp Arg 
115 

Ala Arg Arg Arg Arg 
130 



Ala Glu Asp Cys Gly 
55 

Arg Leu Glu Ala Leu 
70 

Arg lie Asn Ala Arg 
90 

Met lie Thr Ala Ala 
105 

Ala Arg Ala Pro Arg 
120 

Thr lie Arg Pro Val 
135 



Asn Ser Val Ser Thr Val 
60 

Asn Leu Leu Val Arg Thr 
75 80 
Tyr Tyr Phe Thr Leu His 
95 

Leu Thr Asp Pro Pro Val 
110 

Lys Val Ser Asn Thr Asp 
125 

Ser 



<210> 67 
<211> 564 
<212> DNA 
<213> Bacteria 



<220> 

<221> CDS 

<222> (1) . . . (564) 



<400> 67 

gtg cca gac ggt cag ttg ccg ccc tgt acg gga gga age teg aag cct 4 8 
Val Pro Asp Gly Gin Leu Pro Pro Cys Thr Gly Gly Ser Ser Lys Pro 
15 10 15 



tea ggc tgc teg tgt cca teg cca acg age gtc tgc gtc acg gec agg 96 
Ser Gly Cys Ser Cys Pro Ser Pro Thr Ser Val Cys Val Thr Ala Arg 
20 25 30 



acg teg ccc ttc gcg teg tac tgg acg gtc egg tac age ate aca ggc 144 
Thr Ser Pro Phe Ala Ser Tyr Trp Thr Val Arg Tyr Ser lie Thr Gly 
35 40 45 



ttg etc gcg gag ggc gga agg ccg aag tct teg cac teg aac cag gcg 192 
Leu Leu Ala Glu Gly Gly Arg Pro Lys Ser Ser His Ser Asn Gin Ala 
50 55 60 



age ate cga gcg gtc agg gtg cgc tec age egg acg cac ccc gga ate 240 
Ser lie Arg Ala Val Arg Val Arg Ser Ser Arg Thr His Pro Gly lie 
65 70 75 80 



gtg ggg gec aca ggc gtt aga gee agg tec ggc gag acg cgc ggc ggc 288 
Val Gly Ala Thr Gly Val Arg Ala Arg Ser Gly Glu Thr Arg Gly Gly 
85 90 95 



ggg ate ggg age cca aga cct ggc gca age etc cgc acc gtc tea etc 336 
Gly He Gly Ser Pro Arg Pro Gly Ala Ser Leu Arg Thr Val Ser Leu 
100 105 110 



acg acg gec acc ggg tea cca agg ctg gtc agg teg aac cac teg ccc 384 
Thr Thr Ala Thr Gly Ser Pro Arg Leu Val Arg Ser Asn His Ser Pro 
115 120 125 



cgc egg ttg tgc tea egg aac tec tgg tgg age gec ccc tea agg gcg 432 
Arg Arg Leu Cys Ser Arg Asn Ser Trp Trp Ser Ala Pro Ser Arg Ala 
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130 135 140 

egg ccg ccc teg cac gtc cac age acc gac age gtc age ggc aga ccg 480 
Arg Pro Pro Ser His Val His Ser Thr Asp Ser Val Ser Gly Arg Pro 
145 150 155 160 

gtc tgc ate gtg egg act etc etc tec acg tct gtc gtg gtg ccg ate 528 
Val Cys lie Val Arg Thr Leu Leu Ser Thr Ser Val Val Val Pro lie 
165 170 175 

ttc acc agg tct agg ccc teg get cct aag aga tag 564 
Phe Thr Arg Ser Arg Pro Ser Ala Pro Lys Arg * 
180 185 



<210> 68 
<211> 187 
<212> PRT 
<213> Bacteria 

<400> 68 

Val Pro Asp Gly Gin Leu Pro Pro Cys Thr Gly Gly Ser Ser Lys Pro 

15 10 15 

Ser Gly Cys Ser Cys Pro Ser Pro Thr Ser Val Cys Val Thr Ala Arg 

20 25 30 

Thr Ser Pro Phe Ala Ser Tyr Trp Thr Val Arg Tyr Ser lie Thr Gly 

35 40 45 

Leu Leu Ala Glu Gly Gly Arg Pro Lys Ser Ser His Ser Asn Gin Ala 

50 55 60 

Ser lie Arg Ala Val Arg Val Arg Ser Ser Arg Thr His Pro Gly lie 
65 70 75 80 

Val Gly Ala Thr Gly Val Arg Ala Arg Ser Gly Glu Thr Arg Gly Gly 

85 90 95 

Gly lie Gly Ser Pro Arg Pro Gly Ala Ser Leu Arg Thr Val Ser Leu 

100 105 110 

Thr Thr Ala Thr Gly Ser Pro Arg Leu Val Arg Ser Asn His Ser Pro 

115 120 125 

Arg Arg Leu Cys Ser Arg Asn Ser Trp Trp Ser Ala Pro Ser Arg Ala 

130 135 140 

Arg Pro Pro Ser His Val His Ser Thr Asp Ser Val Ser Gly Arg Pro 
145 150 155 160 

Val Cys lie Val Arg Thr Leu Leu Ser Thr Ser Val Val Val Pro lie 

165 170 175 

Phe Thr Arg Ser Arg Pro Ser Ala Pro Lys Arg 
180 185 



<210> 69 
<211> 798 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (798) 

<400> 69 

atg gcg act agg egg aag ggc cgc cct ggc ggc tat gag gaa ate gee 4 8 
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Met Ala Thr Arg Arg Lys Gly Arg Pro Gly Gly Tyr Glu Glu lie Ala 
15 10 15 

gcg cac ttt egg egg etc atg gac teg ggc gag ttg tec cct ggc gac 96 
Ala His Phe Arg Arg Leu Met Asp Ser Gly Glu Leu Ser Pro Gly Asp 
20 25 30 

ccg ctg ccc tec atg cgc gac gtg tgc gac cag ttc ggt teg gcg ate 144 
Pro Leu Pro Ser Met Arg Asp Val Cys Asp Gin Phe Gly Ser Ala lie 
35 40 45 

acg acg gtg aac egg gcg ttc egg etc etc cag gag gag ggc egg acg 192 
Thr Thr Val Asn Arg Ala Phe Arg Leu Leu Gin Glu Glu Gly Arg Thr 
50 55 60 

gtc tec aag ccg ggc gtg ggc acg ate gtc egg gac atg tec egg gtt 240 
Val Ser Lys Pro Gly Val Gly Thr lie Val Arg Asp Met Ser Arg Val 
65 70 75 80 

c 99 9tg CC S ttc agt acg tac ggc gac gtc ctg gcg ccg ggc ggc gat 288 
Arg Val Pro Phe Ser Thr Tyr Gly Asp Val Leu Ala Pro Gly Gly Asp 
85 90 95 

aa ST $Q C c cg tgg gag cgt gcg acg gee gcg cag ggc ctt gac ggc egg 336 
Lys Gly Pro Trp Glu Arg Ala Thr Ala Ala Gin Gly Leu Asp Gly Arg 
100 105 110 

atg etc gtg gag gcg ccc gag gag gtc ggg gee ccg gcg gac gtc gee 384 
Met Leu Val Glu Ala Pro Glu Glu Val Gly Ala Pro Ala Asp Val Ala 
115 120 125 

gcg cgc etc ggc ate gag ccg ggc gee ctg gtc gtc cac egg egg cgc 432 
Ala Arg Leu Gly lie Glu Pro Gly Ala Leu Val Val His Arg Arg Arg 
130 135 140 

cgc gee acg ate ggc gag gac gtc gtc cag etc caa gac gee tgg tac 480 
Arg Ala Thr lie Gly Glu Asp Val Val Gin Leu Gin Asp Ala Trp Tyr 
145 150 155 160 

ccg ctg gag ate gee egg gee gee ggc ctg gac egg ccg ggg aag gtc 528 
Pro Leu Glu lie Ala Arg Ala Ala Gly Leu Asp Arg Pro Gly Lys Val 
165 170 175 

9"tg ggt ggt gtc etc ggt gee atg acg ggc gee ggc ctt teg ccg acg 576 
Val Gly Gly Val Leu Gly Ala Met Thr Gly Ala Gly Leu Ser Pro Thr 
180 185 190 

tec acc gac cac gac gtc gag gtg tgg gtg ccg tec gcg cag caa gec 624 
Ser Thr Asp His Asp Val Glu Val Trp Val Pro Ser Ala Gin Gin Ala 
195 200 205 

gcg gaa etc tec etc ggc tec cgc gtg teg gtc ctg gtc gtc gag cgc 672 
Ala Glu Leu Ser Leu Gly Ser Arg Val Ser Val Leu Val Val Glu Arg 
210 215 220 

gtc acc tac gac gcg acg gtc cgt gtc ctg gaa ctg acc cgt cac acg 72 0 
Val Thr Tyr Asp Ala Thr Val Arg Val Leu Glu Leu Thr Arg His Thr 
225 230 235 240 
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ggc gcg get gac agg ctg acg ctg acc tac aag ggc ctg cca etc egg 768 
Gly Ala Ala Asp Arg Leu Thr Leu Thr Tyr Lys Gly Leu Pro Leu Arg 
245 250 255 



gcg acc gga gec gag ggg age acg tea tga 798 
Ala Thr Gly Ala Glu Gly Ser Thr Ser * 
260 265 



<210> 70 
<211> 265 
<212> PRT 
<213> Bacteria 

<400> 70 

Met Ala Thr Arg Arg Lys Gly Arg Pro Gly Gly Tyr Glu Glu lie Ala 

15 10 15 

Ala His Phe Arg Arg Leu Met Asp Ser Gly Glu Leu Ser Pro Gly Asp 

20 25 30 

Pro Leu Pro Ser Met Arg Asp Val Cys Asp Gin Phe Gly Ser Ala lie 

35 40 45 

Thr Thr Val Asn Arg Ala Phe Arg Leu Leu Gin Glu Glu Gly Arg Thr 

50 55 60 

Val Ser Lys Pro Gly Val Gly Thr lie Val Arg Asp Met Ser Arg Val 
65 70 75 80 

Arg Val Pro Phe Ser Thr Tyr Gly Asp Val Leu Ala Pro Gly Gly Asp 

85 90 95 

Lys Gly Pro Trp Glu Arg Ala Thr Ala Ala Gin Gly Leu Asp Gly Arg 

100 105 110 

Met Leu Val Glu Ala Pro Glu Glu Val Gly Ala Pro Ala Asp Val Ala 

115 120 125 

Ala Arg Leu Gly lie Glu Pro Gly Ala Leu Val Val His Arg Arg Arg 

130 135 140 

Arg Ala Thr lie Gly Glu Asp Val Val Gin Leu Gin Asp Ala Trp Tyr 
145 150 155 160 

Pro Leu Glu lie Ala Arg Ala Ala Gly Leu Asp Arg Pro Gly Lys Val 

165 170 175 

Val Gly Gly Val Leu Gly Ala Met Thr Gly Ala Gly Leu Ser Pro Thr 

180 185 190 

Ser Thr Asp His Asp Val Glu Val Trp Val Pro Ser Ala Gin Gin Ala 

195 200 205 

Ala Glu Leu Ser Leu Gly Ser Arg Val Ser Val Leu Val Val Glu Arg 

210 215 220 

Val Thr Tyr Asp Ala Thr Val Arg Val Leu Glu Leu Thr Arg His Thr 
225 230 235 240 

Gly Ala Ala Asp Arg Leu Thr Leu Thr Tyr Lys Gly Leu Pro Leu Arg 

245 • 250 255 

Ala Thr Gly Ala Glu Gly Ser Thr Ser 
260 265 



<210> 71 
<211> 378 
<212> DNA 
<213> Bacteria 

<220> 
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<221> CDS 

<222> {!)... (378) 



<400> 71 

atg tec acg acc acc aac gcg gtc acc tgg ttc gag gtc ggc acc gac 4 8 
Met Ser Thr Thr Thr Asn Ala Val Thr Trp Phe Glu Val Gly Thr Asp 
15 10 15 

egg ccg gag gag acc ggg cgc ttc tac gec gac ctg ttc ggt tgg gcg 96 
Arg Pro Glu Glu Thr Gly Arg Phe Tyr Ala Asp Leu Phe Gly Trp Ala 
20 25 30 

ttc ggc gag cag ggg acg ccg gag gcg teg tac egg gtg acg gag ccg 144 
Phe Gly Glu Gin Gly Thr Pro Glu Ala Ser Tyr Arg Val Thr Glu Pro 
35 40 45 

ggg ccg gag ggc teg ate cag ggc gcg ate egg ggc acc ggc ggg gcg 192 
Gly Pro Glu Gly Ser lie Gin Gly Ala lie Arg Gly Thr Gly Gly Ala 
50 55 60 

age ccg aac tac gee ate ttc tac gtg cag gtg gec gac gtg gcg gac 24 0 
Ser Pro Asn Tyr. Ala lie Phe Tyr Val Gin Val Ala Asp Val Ala Asp 
65 70 75 80 

gee tgc egg cgc gcg gag gcg gec ggt ggc aag gtg ctg gtg ccg gcg 288 
Ala Cys Arg Arg Ala Glu Ala Ala Gly Gly Lys Val Leu Val Pro Ala 
85 90 95 

aag tec acc gac aac ggg etc acc ttc gee cac ctg etc gac ccg gtc 336 
Lys Ser Thr Asp Asn Gly Leu Thr Phe Ala His Leu Leu Asp Pro Val 
100 105 110 

ggc aac cac ttc ggc gtc ttc gec ccg ccg ccg gee gec tga 3 78 

Gly Asn His Phe Gly Val Phe Ala Pro Pro Pro Ala Ala * 
115 120 125 



<210> 72 
<211> 125 
<212> PRT 
<213> Bacteria 

<400> 72 

Met Ser Thr Thr Thr Asn Ala Val Thr Trp Phe Glu Val Gly Thr Asp 

15 10 15 

Arg Pro Glu Glu Thr Gly Arg Phe Tyr Ala Asp Leu Phe Gly Trp Ala 

20 25 30 

Phe Gly Glu Gin Gly Thr Pro Glu Ala Ser Tyr Arg Val Thr Glu Pro 

35 40 45 

Gly Pro Glu Gly Ser lie Gin Gly Ala lie Arg Gly Thr Gly Gly Ala 

50 55 60 

Ser Pro Asn Tyr Ala lie Phe Tyr Val Gin Val Ala Asp Val Ala Asp 
65 70 75 80 

Ala Cys Arg Arg Ala Glu Ala Ala Gly Gly Lys Val Leu Val Pro Ala 

85 90 95 

Lys Ser Thr Asp Asn Gly Leu Thr Phe Ala His Leu Leu Asp Pro Val 
100 105 110 
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Gly Asn His Phe Gly Val Phe Ala Pro Pro Pro Ala Ala 
115 120 125 



<210> 73 
<211> 741 
<212> DNA 
<213> Bacteria 



<220> 

<221> CDS 

<222> (1) . . . (741) 



<400> 73 

gtg egg cga egg cct gaa teg tgg ggc egg aag ccg gag ccg ccg tec 48 

Val Arg Arg Arg Pro Glu Ser Trp Gly Arg Lys Pro Glu Pro Pro Ser 
15 10 15 



gee ccg gcg agg ttg ccg ggg egg acg gcg tac ggt cac ttg ccg gec 96 
Ala Pro Ala Arg Leu Pro Gly Arg Thr Ala Tyr Gly His Leu Pro Ala 
20 25 30 

gag cct ccg cga ccg ccc ggg ccg gec agg acg ccg gee teg gcg gee 144 
Glu Pro Pro Arg Pro Pro Gly Pro Ala Arg Thr Pro Ala Ser Ala Ala 
35 40 45 

gcg gtg ate gcg tec gee tgc tec tgg gtg age ttg ccg tec teg ace 192 
Ala Val lie Ala Ser Ala Cys Ser Trp Val Ser Leu Pro Ser Ser Thr 
50 55 60 



gec tgc gee agg cgc tec ttc age gcg gee tgc egg teg gcg gag tea 240 
Ala Cys Ala Arg Arg Ser Phe Ser Ala Ala Cys Arg Ser Ala Glu Ser 
65 70 75 80 

ccc cgc teg ggc egg teg gee ggc ttc tgc gee teg cgc acc ttc tec 288 
Pro Arg Ser Gly Arg Ser Ala Gly Phe Cys Ala Ser Arg Thr Phe Ser 
85 90 95 

age gcg gee gtc acc ttg teg gtg teg acg ccc age tec ttg gee agg 3 36 
Ser Ala Ala Val Thr Leu Ser Val Ser Thr Pro Ser Ser Leu Ala Arg 
100 105 110 

gee teg gcg aac tec gee tgc cgc teg gee cgc tgc tgc tgc cgc teg 3 84 
Ala Ser Ala Asn Ser Ala Cys Arg Ser Ala Arg Cys Cys Cys Arg Ser 
115 120 125 

tea ctg ctg ctg ccg etc teg etc gcg ctg gcg etc ggc gtc gcg gtg 432 
Ser Leu Leu Leu Pro Leu Ser Leu Ala Leu Ala Leu Gly Val Ala Val 
130 135 140 



ccg ccg tec gcg gcg aac gcg acc gtc ggc gec gcg ate ccc acg ccg 480 
Pro Pro Ser Ala Ala Asn Ala Thr Val Gly Ala Ala lie Pro Thr Pro 
145 150 155 160 



aga acc ccg gec gcg gee agg ccg gee age agg tgc ttc ttc ttc atg 52 8 
Arg Thr Pro Ala Ala Ala Arg Pro Ala Ser Arg Cys Phe Phe Phe Met 
165 170 175 
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qtq ccg gac atg ctg tec tec gtc gga teg gtg gtt ggt gcg atg acc 576 
tl Pro Lp Met Leu Ser Ser Val Gly Ser Val Val Gly Ala Met Thr 
180 185 190 



tea ccc gac ggt gac cag ccc ggc tgg ggg aaa gee gtg gtg aac ctg 
Ser Pro Asp Gly L P Gin Pro Gly Trp Gly Lys Ala Val Val Asn Leu 

195 



200 205 



tea geg age tgg caa tec gec cgc cgc gee gga caa acg ggt tgc egg 
Ser La Ser Trp Gin Ser Ala Arg Arg Ala Gly Gin Thr Gly Cys Arg 

215 220 



210 



ggc gec cgc cgc egg tea ggg ttg gtg gtc ggc agg ccc cac egg ggc 
lly Ala Arg Arg Arg Ser Gly Leu Val Val Gly Arg Pro Hi. Arg Gly 



225 



230 



235 



gga gca gac gac gga agg tga 
Gly Ala Asp Asp Gly Arg * 
245 



624 



672 



720 



741 



<210> 74 
<211> 246 
<212> PRT 
<213> Bacteria 



<400> 74 
Val Arg Arg Arg 
1 

Ala Pro Ala Arg 
20 

Glu Pro Pro Arg 
35 

Ala Val He Ala 
50 

Ala Cys Ala Arg 
65 

Pro Arg Ser Gly 

Ser Ala Ala Val 
100 

Ala Ser Ala Asn 
115 

Ser Leu Leu Leu 
130 

Pro Pro Ser Ala 
145 

Arg Thr Pro Ala 

Val Pro Asp Met 
180 

Ser Pro Asp Gly 
195 

Ser Ala Ser Trp 
210 

Gly Ala Arg Arg 
225 

Gly Ala Asp Asp 



Pro Glu Ser Trp 
5 

Leu Pro Gly Arg 

Pro Pro Gly Pro 
40 

Ser Ala Cys Ser 
55 

Arg Ser Phe Ser 
70 

Arg Ser Ala Gly 
85 

Thr Leu Ser Val 

Ser Ala Cys Arg 
120 

Pro Leu Ser Leu 
135 

Ala Asn Ala Thr 
150 

Ala Ala Arg Pro 
165 

Leu Ser Ser Val 

Asp Gin Pro Gly 
200 

Gin Ser Ala Arg 
215 

Arg Ser Gly Leu 

230 
Gly Arg 



Gly Arg Lys Pro 
10 

Thr Ala Tyr Gly 
25 

Ala Arg Thr Pro 

Trp Val Ser Leu 
60 

Ala Ala Cys Arg 
75 

Phe Cys Ala Ser 
90 

Ser Thr Pro Ser 
105 

Ser Ala Arg Cys 

Ala Leu Ala Leu 
140 

Val Gly Ala Ala 
155 

Ala Ser Arg Cys 
170 

Gly Ser Val Val 
185 

Trp Gly Lys Ala 

Arg Ala Gly Gin 
220 

Val Val Gly Arg 
235 



Glu Pro Pro Ser 
15 

His Leu Pro Ala 
30 

Ala Ser Ala Ala 
45 

Pro Ser Ser Thr 

Ser Ala Glu Ser 
80 

Arg Thr Phe Ser 
95 

Ser Leu Ala Arg 

110 

Cys Cys Arg Ser 
125 

Gly Val Ala Val 

He Pro Thr Pro 
160 

Phe Phe Phe Met 
175 

Gly Ala Met Thr 
190 

Val Val Asn Leu 
205 

Thr Gly Cys Arg 

Pro His Arg Gly 
240 
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245 



<210> 75 
<211> 891 
<212> DNA 
<213> Bacteria 



<220> 

<221> CDS 

<222> (1) . . . (891) 



<400> 75 

gtg ate etc gtg gcg ttg gcg aag ate egg gaa gtc ccg etc ace ggg 48 
Val lie Leu Val Ala Leu Ala Lys lie Arg Glu Val Pro Leu Thr Gly 
15 10 15 



gcg gac gec ggc ccg tac ggc gtc ace gtc ggc ccc gac ggc gcg etc 96 
Ala Asp Ala Gly Pro Tyr Gly Val Thr Val Gly Pro Asp Gly Ala Leu 
20 25 30 



tgg ctg acg ctg gtc cac gec ggc gcg gtc gec egg gtg ggc gcg gac 144 
Trp Leu Thr Leu Val His Ala Gly Ala Val Ala Arg Val Gly Ala Asp 
35 40 45 



ggc gac ctg cgc ace tgg cag gtg gcg gec gac age egg ccg ctg ate 192 
Gly Asp Leu Arg Thr Trp Gin Val Ala Ala Asp Ser Arg Pro Leu He 
50 55 60 



gtc acg ccg ggc ccc gac ggc gee etc tgg ttc ace cgc tec ggc gac 24 0 
Val Thr Pro Gly Pro Asp Gly Ala Leu Trp Phe Thr Arg Ser Gly Asp 
65 70 75 80 



gac egg ate ggc egg ate ace ace gac ggg gag cag age gec gtc gcg 288 
Asp Arg He Gly Arg He Thr Thr Asp Gly Glu Gin Ser Ala Val Ala 
85 90 95 



etc ccg ccc ggg age ggc ccc tgc ggc ate gec gee ggt ccc gac ggc 336 
Leu Pro Pro Gly Ser Gly Pro Cys Gly He Ala Ala Gly Pro Asp Gly 
100 105 110 



gec etc tgg tac gcg gcg atg ace gec gac gcg gtc ggc cgc gtc ace 384 
Ala Leu Trp Tyr Ala Ala Met Thr Ala Asp Ala Val Gly Arg Val Thr 
115 120 125 



acc gac ggg aag gtg acg cag ttt ccg ctg ccg gtg age ggc ggc ttc 4 32 
Thr Asp Gly Lys Val Thr Gin Phe Pro Leu Pro Val Ser Gly Gly Phe 
130 135 140 



gec teg atg gtc gec gec ggc ccg gac gag gec gtc tgg ttc acg etc 480 
Ala Ser Met Val Ala Ala Gly Pro Asp Glu Ala Val Trp Phe Thr Leu 
145 150 155 160 



aac cag gcg aac gcg gtc ggc egg ate ggc acg gac ggc gcg gtg gcg 52 8 
Asn Gin Ala Asn Ala Val Gly Arg He Gly Thr Asp Gly Ala Val Ala 
165 170 175 



ctg cac cca ctg ccg acc gag ggc gec gec ccg gtg ggc ate acg gec 576 
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Leu His Pro Leu Pro Thr Glu Gly Ala Ala Pro Val Gly lie Thr Ala 
180 185 190 



gga gcg gac ggc gcg etc tgg ttc gtc gag ate ggc gec ggc cag etc 624 
Gly Ala Asp Gly Ala Leu Trp Phe Val Glu lie Gly Ala Gly Gin Leu 
195 200 205 



ggc egg ate ace ccg gac ggg egg ate gac gag tac ccg ctg ccg gac 672 
Gly Arg lie Thr Pro Asp Gly Arg lie Asp Glu Tyr Pro Leu Pro Asp 
210 215 220 



egg gcg gee egg ccg cac gcg ate gtc gee gac ccg gcg ggc ggc tgc 72 0 
Arg Ala Ala Arg Pro His Ala lie Val Ala Asp Pro Ala Gly Gly Cys 
225 230 235 240 



tgg ttc ace gag tgg ggc ggc aac egg ate ggc cac gtc gee ccg gac 76 8 
Trp Phe Thr Glu Trp Gly Gly Asn Arg lie Gly His Val Ala Pro Asp 
245 250 255 



ggc acg ate gtc ace cac gac ctt ccg ace ccg gee gee gag ccg cac 816 
Gly Thr lie Val Thr His Asp Leu Pro Thr Pro Ala Ala Glu Pro His 
260 265 270 



ggc ate acc gtc gec ccc gac ggc acg gtc tgg gec gee ctg gaa acg 864 
Gly lie Thr Val Ala Pro Asp Gly Thr Val Trp Ala Ala Leu Glu Thr 
275 280 285 



ggc get ctg gee cac ctg acg ccc tga 891 
Gly Ala Leu Ala His Leu Thr Pro * 
290 295 



<210> 76 
<211> 125 
<212> PRT 
<213> Bacteria 



<400> 76 
Met Ser Thr Thr 
1 

Arg Pro Glu Glu 
20 

Phe Gly Glu Gin 
35 

Gly Pro Glu Gly 
50 

Ser Pro Asn Tyr 
65 

Ala Cys Arg Arg 

Lys Ser Thr Asp 
100 

Gly Asn His Phe 
115 



Thr Asn Ala Val 
5 

Thr Gly Arg Phe 

Gly Thr Pro Glu 
40 

Ser lie Gin Gly 
55 

Ala lie Phe Tyr 
70 

Ala Glu Ala Ala 
85 

Asn Gly Leu Thr 

Gly Val Phe Ala 
120 



Thr Trp Phe Glu 
10 

Tyr Ala Asp Leu 
25 

Ala Ser Tyr Arg 

Ala lie Arg Gly 
60 

Val Gin Val Ala 
75 

Gly Gly Lys Val 
90 

Phe Ala His Leu 
105 

Pro Pro Pro Ala 



Val Gly Thr Asp 
15 

Phe Gly Trp Ala 
30 

Val Thr Glu Pro 
45 

Thr Gly Gly Ala 

Asp Val Ala Asp 
80 

Leu Val Pro Ala 
95 

Leu Asp Pro Val 
110 

Ala 
125 



<210> 77 
<211> 1086 



DC01 348871 v 1 



99 



<212> DNA 
<213> Bacteria 



<220> 

<221> CDS 

<222> (1) . . . (1086) 

<400> 77 

gtg acc gcg gcg ggc ccc gag cgc etc cac egg cgc ggc ggc ggc etc 48 

Val Thr Ala Ala Gly Pro Glu Arg Leu His Arg Arg Gly Gly Gly Leu 
15 10 15 

ccc egg ccg acc gga gca ggc gtc cca cgc teg gcg gag tgg gec gac 96 
Pro Arg Pro Thr Gly Ala Gly Val Pro Arg Ser Ala Glu Trp Ala Asp 
20 25 30 

egg tac gtg ggc ggc gec ccg ccg cga eta ggc tct gee get gtg tec 144 
Arg Tyr Val Gly Gly Ala Pro Pro Arg Leu Gly Ser Ala Ala Val Ser 
35 40 45 

gac cat gec age acg act ccc gec acc gee gta cga ccg ccg gtg ctg 192 
Asp His Ala Ser Thr Thr Pro Ala Thr Ala Val Arg Pro Pro Val Leu 
50 55 60 

tgc ccc ggc gac acg gtg atg ctg gtg teg ccg teg ggg ccg acc egg 24 0 
Cys Pro Gly Asp Thr Val Met Leu Val Ser Pro Ser Gly Pro Thr Arg 
65 70 75 80 

ccc gag egg gtg gec egg ggc ate gag ctg etc acc ggc tgg ggg ctg 288 
Pro Glu Arg Val Ala Arg Gly lie Glu Leu Leu Thr Gly Trp Gly Leu 
85 90 95 

egg ccg gtg ctg gcg ccg aac gcg tac gec egg cag ggt tac ctg gec 336 
Arg Pro Val Leu Ala Pro Asn Ala Tyr Ala Arg Gin Gly Tyr Leu Ala 
100 105 110 

ggc gcg gac gag ctg cgc gee gec gac ctg aac gcg gcg ttc gec gac 384 
Gly Ala Asp Glu Leu Arg Ala Ala Asp Leu Asn Ala Ala Phe Ala Asp 
115 120 125 

ccc gag gtg cgc ggg gtg ate tgc acg cgc ggc ggg tac ggc gcg cag 43 2 
Pro Glu Val Arg Gly Val lie Cys Thr Arg Gly Gly Tyr Gly Ala Gin 
130 135 140 

egg ate gtc gac gcg ate gac atg gec gee gta cgc egg gac ccg aag 480 
Arg lie Val Asp Ala lie Asp Met Ala Ala Val Arg Arg Asp Pro Lys 
145 150 155 160 

gtg gtc gee ggg ttc tec gac ate ace gcg ctg cag etc gcg ctg tgg 528 
Val Val Ala Gly Phe Ser Asp lie Thr Ala Leu Gin Leu Ala Leu Trp 
165 170 175 

c 9g gg c g cc c gg ctg gee ggc gtg cac ggc ccc ggg gcg gcg tgg ctg 576 
Arg Gly Ala Arg Leu Ala Gly Val His Gly Pro Gly Ala Ala Trp Leu 
180 185 190 

gac gag cgc act ccg ctg egg teg gcc gag teg ctg cac gcc gcc ctg 624 
Asp Glu Arg Thr Pro Leu Arg Ser Ala Glu Ser Leu His Ala Ala Leu 
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195 



200 



205 



atg acc acc gaa ccg gtg acg gtg acc gcc gtc gcc gag gag gag acg 672 
Met Thr Thr Glu Pro Val Thr Val Thr Ala Val Ala Glu Glu Glu Thr 
210 215 220 

ttc ccg gtg egg gtg ccc ggg egg gcc acc ggc ccg ctg ctg ggc ggc 720 
Phe Pro Val Arg Val Pro Gly Arg Ala Thr Gly Pro Leu Leu Gly Gly 
225 230 235 240 

aac etc tgc ctg gtc gtg gcg teg ctg ggc acc ccg gac atg ccg gac 768 
Asn Leu Cys Leu Val Val Ala Ser Leu Gly Thr Pro Asp Met Pro Asp 
245 250 255 

ctg acc ggc gcg ate ctg ttg ate gag gac gtg cag gag ccg ccg tac 816 
Leu Thr Gly Ala lie Leu Leu lie Glu Asp Val Gin Glu Pro Pro Tyr 
260 265 270 

aag gtg gac egg atg etc acc cag ttg cgc egg gcc ggc gcg ctg gac 864 
Lys Val Asp Arg Met Leu Thr Gin Leu Arg Arg Ala Gly Ala Leu Asp 
275 280 285 

ggg ctg gcc ggg gtg gcg gtc ggc cag ttc acc ggc tgc gcc gac ggc 912 
Gly Leu Ala Gly Val Ala Val Gly Gin Phe Thr Gly Cys Ala Asp Gly 
290 295 300 

tgg teg acc age gtc gcc gac gtg etc tec gag cgc etc ggc gac etc 96 0 
Trp Ser Thr Ser Val Ala Asp Val Leu Ser Glu Arg Leu Gly Asp Leu 
305 310 315 320 

ggc gtc ccg gtc etc ggc ggc ctg ccc gtc ggc cac ggc gtc ggc cag 1008 
Gly Val Pro Val Leu Gly Gly Leu Pro Val Gly His Gly Val Gly Gin 
325 330 335 

etc acc gtc ccg gtc ggc acc gac gcg acc etc gac acg acg acg gcc 1056 
Leu Thr Val Pro Val Gly Thr Asp Ala Thr Leu Asp Thr Thr Thr Ala 
340 345 350 

acc etc acg gtc acc ccc gcc gtc cgc tga 1086 
Thr Leu Thr Val Thr Pro Ala Val Arg * 
355 360 



<210> 78 
<211> 361 
<212> PRT 
<213> Bacteria 

<400> 78 

Val Thr Ala Ala Gly Pro Glu Arg 

1 5 
Pro Arg Pro Thr Gly Ala Gly Val 
20 

Arg Tyr Val Gly Gly Ala Pro Pro 

35 40 
Asp His Ala Ser Thr Thr Pro Ala 

50 55 
Cys Pro Gly Asp Thr Val Met Leu 



Leu His Arg Arg Gly Gly Gly Leu 

10 15 
Pro Arg Ser Ala Glu Trp Ala Asp 
25 30 
Arg Leu Gly Ser Ala Ala Val Ser 
45 

Thr Ala Val Arg Pro Pro Val Leu 
60 

Val Ser Pro Ser Gly Pro Thr Arg 
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65 

Pro Glu Arg Val 

Arg Pro Val Leu 
100 

Gly Ala Asp Glu 
115 

Pro Glu Val Arg 
130 

Arg lie Val Asp 
145 

Val Val Ala Gly 

Arg Gly Ala Arg 
180 

Asp Glu Arg Thr 
195 

Met Thr Thr Glu 
210 

Phe Pro Val Arg 
225 

Asn Leu Cys Leu 

Leu Thr Gly Ala 
260 

Lys Val Asp Arg 
275 

Gly Leu Ala Gly 
290 

Trp Ser Thr Ser 
305 

Gly Val Pro Val 

Leu Thr Val Pro 
340 

Thr Leu Thr Val 
355 



70 

Ala Arg Gly lie 
85 

Ala Pro Asn Ala 

Leu Arg Ala Ala 
120 

Gly Val lie Cys 
135 

Ala lie Asp Met 
150 

Phe Ser Asp lie 
165 

Leu Ala Gly Val 

Pro Leu Arg Ser 
200 

Pro Val Thr Val 
215 

Val Pro Gly Arg 
230 

Val Val Ala Ser 
245 

lie Leu Leu lie 

Met Leu Thr Gin 
280 

Val Ala Val Gly 
295 

Val Ala Asp Val 
310 

Leu Gly Gly Leu 
325 

Val Gly Thr Asp 

Thr Pro Ala Val 
360 



75 

Glu Leu Leu Thr 
90 

Tyr Ala Arg Gin 
105 

Asp Leu Asn Ala 

Thr Arg Gly Gly 
140 

Ala Ala Val Arg 
155 

Thr Ala Leu Gin 
170 

His Gly Pro Gly 
185 

Ala Glu Ser Leu 

Thr Ala Val Ala 
220 

Ala Thr Gly Pro 
235 

Leu Gly Thr Pro 
250 

Glu Asp Val Gin 
265 

Leu Arg Arg Ala 

Gin Phe Thr Gly 
300 

Leu Ser Glu Arg 
315 

Pro Val Gly His 
330 

Ala Thr Leu Asp 

345 

Arg 



80 

Gly Trp Gly Leu 
95 

Gly Tyr Leu Ala 
110 

Ala Phe Ala Asp 
125 

Tyr Gly Ala Gin 

Arg Asp Pro Lys 
160 

Leu Ala Leu Trp 
175 

Ala Ala Trp Leu 
190 

His Ala Ala Leu 
205 

Glu Glu Glu Thr 

Leu Leu Gly Gly 
240 

Asp Met Pro Asp 
255 

Glu Pro Pro Tyr 
270 

Gly Ala Leu Asp 
285 

Cys Ala Asp Gly 

Leu Gly Asp Leu 
320 

Gly Val Gly Gin 
335 

Thr Thr Thr Ala 
350 



<210> 79 
<211> 861 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (861) 

<400> 79 

gtg gga cgc ctg etc egg teg gec ggg gga ggc cgc cgc cgc gec ggt 4 8 
Val Gly Arg Leu Leu Arg Ser Ala Gly Gly Gly Arg Arg Arg Ala Gly 
15 10 15 



99 a 99 c £T ct c< 39 99 c cc 9 cc 9" egg tea ccg get age etc gac gtc gtg 96 

Gly Gly Ala Arg Gly Pro Pro Arg Ser Pro Ala Ser Leu Asp Val Val 
20 25 30 

get acc gcg ttg gtg ate gag aac gac ccg ace gac gac gtc cgc egg 144 
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Ala Thr Ala Leu Val lie Glu Asn Asp Pro Thr Asp Asp Val Arg Arg 
35 40 45 



etc ggc gag tgg ctg acc gag gcg ggt etc gac ctg tgg gtc gtc cgc 192 
Leu Gly Glu Trp Leu Thr Glu Ala Gly Leu Asp Leu Trp Val Val Arg 
50 55 60 

gcg cac gec ggc gat cag etc ccc gec gac ctg gag ggc tac tec gcg 24 0 
Ala His Ala Gly Asp Gin Leu Pro Ala Asp Leu Glu Gly Tyr Ser Ala 
65 70 75 80 

ctg gtg gtg ctg ggc ggc gag cag cag gcg tac ccg ctg ccc gac ggc 288 
Leu Val Val Leu Gly Gly Glu Gin Gin Ala Tyr Pro Leu Pro Asp Gly 
85 90 95 

teg ccc ggc gcg ccc tgg ttc ccc gec gtc gag ggg ctg etc cgc aag 33 6 
Ser Pro Gly Ala Pro Trp Phe Pro Ala Val Glu Gly Leu Leu Arg Lys 
100 105 110 

gec gtc egg gac egg gtg ccc acc ctg ggc ate tgc ctg ggc gcg cag 384 
Ala Val Arg Asp Arg Val Pro Thr Leu Gly lie Cys Leu Gly Ala Gin 
115 120 125 

ttg ctg gcg acc gec cac gec ggc gag gtc gag cgc age gcg tec ggg 432 
Leu Leu Ala Thr Ala His Ala Gly Glu Val Glu Arg Ser Ala Ser Gly 
130 135 140 

ccg gag gtc ggg ccc ggt gtg gtc ggc aag egg gac gee gec gac gec 48 0 
Pro Glu Val Gly Pro Gly Val Val Gly Lys Arg Asp Ala Ala Asp Ala 
145 150 155 160 

gac ccg ctg ttc cgc tac gtc ccg ctg ate ccc gac gtg etc cag tgg 528 
Asp Pro Leu Phe Arg Tyr Val Pro Leu lie Pro Asp Val Leu Gin Trp 
165 170 175 

cac gec gac gag ate acc gag ctg ccc egg ggc gee acc ctg ctg gec 576 
His Ala Asp Glu lie Thr Glu Leu Pro Arg Gly Ala Thr Leu Leu Ala 
180 185 190 

gec tec acc cgc tac ccg cac cag gcg ttc cgc etc ggc gac egg gec 624 
Ala Ser Thr Arg Tyr Pro His Gin Ala Phe Arg Leu Gly Asp Arg Ala 
195 200 205 

tgg 9gg ct g ca 9 ttc cac at c g a g tgc g ac ac c gcg at g atc gec gac 672 
Trp Gly Leu Gin Phe His lie Glu Cys Asp Thr Ala Met lie Ala Asp 
210 215 220 

tgg gee acc gac teg acg ctg ctg gee gag ctg ggc tac gac ccg gac 720 
Trp Ala Thr Asp Ser Thr Leu Leu Ala Glu Leu Gly Tyr Asp Pro Asp 
225 230 235 240 

ctg gtg gtg gcg gec tgc cac gcg gtg atg gtc gac gtc gag gag gtc 768 
Leu Val Val Ala Ala Cys His Ala Val Met Val Asp Val Glu Glu Val 
245 250 255 

tgg cag ccg ttc gee gec egg ttc gec gcg ctg get etc ggc gag ctg 816 
Trp Gin Pro Phe Ala Ala Arg Phe Ala Ala Leu Ala Leu Gly Glu Leu 
260 265 270 
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gac gac gac acg tec cgc cgc age ctg ccg ctg etc ggg cag tga 
Asp Asp Asp Thr Ser Arg Arg Ser Leu Pro Leu Leu Gly Gin * 
275 280 285 



861 



<210> 80 

<211> 286 

<212> PRT 

<213> Bacteria 

<400> 80 
Val Gly Arg Leu 
1 

Gly Gly Ala Arg 
20 

Ala Thr Ala Leu 
35 

Leu Gly Glu Trp 
50 

Ala His Ala Gly 
65 

Leu Val Val Leu 

Ser Pro Gly Ala 
100 

Ala Val Arg Asp 
115 

Leu Leu Ala Thr 
130 

Pro Glu Val Gly 
145 

Asp Pro Leu Phe 

His Ala Asp Glu 
180 

Ala Ser Thr Arg 
195 

Trp Gly Leu Gin 
210 

Trp Ala Thr Asp 
225 

Leu Val Val Ala 

Trp Gin Pro Phe 
260 

Asp Asp Asp Thr 
275 



Leu Arg Ser Ala 
5 

Gly Pro Pro Arg 

Val He Glu Asn 
40 

Leu Thr Glu Ala 
55 

Asp Gin Leu Pro 
70 

Gly Gly Glu Gin 
85 

Pro Trp Phe Pro 

Arg Val Pro Thr 
120 

Ala His Ala Gly 
135 

Pro Gly Val Val 
150 

Arg Tyr Val Pro 
165 

He Thr Glu Leu 

Tyr Pro His Gin 
200 

Phe His He Glu 
215 

Ser Thr Leu Leu 
230 

Ala Cys His Ala 
245 

Ala Ala Arg Phe 

Ser Arg Arg Ser 
280 



Gly Gly Gly Arg 
10 

Ser Pro Ala Ser 
25 

Asp Pro Thr Asp 

Gly Leu Asp Leu 
60 

Ala Asp Leu Glu 
75 

Gin Ala Tyr Pro 
90 

Ala Val Glu Gly 
105 

Leu Gly He Cys 

Glu Val Glu Arg 
140 

Gly Lys Arg Asp 
155 

Leu He Pro Asp 
170 

Pro Arg Gly Ala 
185 

Ala Phe Arg Leu 

Cys Asp Thr Ala 
220 

Ala Glu Leu Gly 
235 

Val Met Val Asp 
250 

Ala Ala Leu Ala 
265 

Leu Pro Leu Leu 



Arg Arg Ala Gly 
15 

Leu Asp Val Val 
30 

Asp Val Arg Arg 
45 

Trp Val Val Arg 

Gly Tyr Ser Ala 
80 

Leu Pro Asp Gly 
95 

Leu Leu Arg Lys 
110 

Leu Gly Ala Gin 
125 

Ser Ala Ser Gly 

Ala Ala Asp Ala 
160 

Val Leu Gin Trp 
175 

Thr Leu Leu Ala 
190 

Gly Asp Arg Ala 
205 

Met He Ala Asp 

Tyr Asp Pro Asp 
240 

Val Glu Glu Val 
255 

Leu Gly Glu Leu 

270 
Gly Gin 
285 



<210> 81 
<211> 3033 
<212> DNA 
<213> Bacteria 

<220> 
<22l> CDS 
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<222> (1) 



(3033) 



<400> 81 

gtg atg age egg ccg acc age gec gec gga egg etc gee cgc tac ggt 4 8 

Val Met Ser Arg Pro Thr Ser Ala Ala Gly Arg Leu Ala Arg Tyr Gly 
15 10 15 

ttc ggc ate gtc ggc ggc gac ggc gee acc cgc gec gec gac ctg etc 96 
Phe Gly lie Val Gly Gly Asp Gly Ala Thr Arg Ala Ala Asp Leu Leu 
20 25 30 

ggc ccc gac ggg ctg ggc ctg tgg egg ccg gac gtg cag gag ccg acc 144 
Gly Pro Asp Gly Leu Gly Leu Trp Arg Pro Asp Val Gin Glu Pro Thr 
35 40 45 

gac gac cgc gcg gcg gag ctg etc gec gcg etc tec egg gec gec gac 192 
Asp Asp Arg Ala Ala Glu Leu Leu Ala Ala Leu Ser Arg Ala Ala Asp 
50 55 60 

ccg gac ctg gcg ctg cgc cag etc cac cgc ate gtc gag gcg gag cgc 24 0 
Pro Asp Leu Ala Leu Arg Gin Leu His Arg lie Val Glu Ala Glu Arg 
65 70 75 80 

egg gec gec ggt ccg gcg gee acc ggt teg gcg ctg gtg gag gcg etc 28 8 
Arg Ala Ala Gly Pro Ala Ala Thr Gly Ser Ala Leu Val Glu Ala Leu 
85 90 95 

gcg gac gac ccg ggg ctg cgc cgc egg ctg ate gee gtc etc ggc gec 336 
Ala Asp Asp Pro Gly Leu Arg Arg Arg Leu lie Ala Val Leu Gly Ala 
100 105 110 

tec teg gcg ctg ggc gac cac ctg gtc gec aac ccc gac cag tgg ccg 3 84 
Ser Ser Ala Leu Gly Asp His Leu Val Ala Asn Pro Asp Gin Trp Pro 
115 120 125 

gec ctg egg acc gee ccg gac ggg etc gcg ccg acc gcg gag ggc egg 432 
Ala Leu Arg Thr Ala Pro Asp Gly Leu Ala Pro Thr Ala Glu Gly Arg 
130 135 140 

etc gac ctg tec ggc gac ggg cag ccg gtc gcg gtg ctg cgc aag gcg 480 
Leu Asp Leu Ser Gly Asp Gly Gin Pro Val Ala Val Leu Arg Lys Ala 
145 150 155 160 

tac egg ctg gcg ctg ctg egg ate gcg gcg gec gac ctg acc ggc gac 52 8 
Tyr Arg Leu Ala Leu Leu Arg lie Ala Ala Ala Asp Leu Thr Gly Asp 
165 170 175 

egg ggc ctg gag cag acg atg gec gcg etc tec gcg ttg gee gac gcg 576 
Arg Gly Leu Glu Gin Thr Met Ala Ala Leu Ser Ala Leu Ala Asp Ala 
180 185 190 

acc ctg gcg gcg gcg tac gag ate gec gtc ggc gag ctg ccg gag ggc 624 
Thr Leu Ala Ala Ala Tyr Glu He Ala Val Gly Glu Leu Pro Glu Gly 
195 200 205 

acg ccc egg ccc egg etc gec gtc gtg gcg atg ggc aag tgc ggc ggt 672 
Thr Pro Arg Pro Arg Leu Ala Val Val Ala Met Gly Lys Cys Gly Gly 
210 215 220 
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gac gag ctg aac tac gtc tec gac gtc gac gtg ate ttc gtg gec gec 720 
Asp Glu Leu Asn Tyr Val Ser Asp Val Asp Val lie Phe Val Ala Ala 
225 230 235 240 

gag gac gac gac etc gec gcg gec ace acg gtc gee acc egg ctg ate 76 8 
Glu Asp Asp Asp Leu Ala Ala Ala Thr Thr Val Ala Thr Arg Leu lie 
245 250 255 

cac gtc tgc ggg ctg gtc gee tgg ccg gtc gac gee gec ctg egg ccc 816 
His Val Cys Gly Leu Val Ala Trp Pro Val Asp Ala Ala Leu Arg Pro 
260 265 270 

gag ggc aat cgt ggc ccg ctg gtg cgc acc ctg gee age cac etc gec 864 
Glu Gly Asn Arg Gly Pro Leu Val Arg Thr Leu Ala Ser His Leu Ala 
275 280 285 

tac tac egg cgc tgg gcg egg acg tgg gag ttc cag gcg ctg etc aag 912 
Tyr Tyr Arg Arg Trp Ala Arg Thr Trp Glu Phe Gin Ala Leu Leu Lys 
290 295 300 

gee egg ccg gcg gec ggc gac ctg acc ctg ggc egg gag tgg ate gac 960 
Ala Arg Pro Ala Ala Gly Asp Leu Thr Leu Gly Arg Glu Trp lie Asp 
305 310 315 320 

cag etc gec ccg etc gtg tgg egg gec gee gag cgc ccc gag gcg gtc 10 08 
Gin Leu Ala Pro Leu Val Trp Arg Ala Ala Glu Arg Pro Glu Ala Val 
325 330 335 

gag gac gtc cgc gee atg egg egg aag ate ate gac aac gtc ccg ccg 10 56 
Glu Asp Val Arg Ala Met Arg Arg Lys lie lie Asp Asn Val Pro Pro 
340 345 350 

aag gag ttg gag cgc gag ate aag cgc ggc ccg ggc ggg ctg cgc gac 1104 
Lys Glu Leu Glu Arg Glu lie Lys Arg Gly Pro Gly Gly Leu Arg Asp 
355 360 365 

ate gag ttc gec gtc cag ctg ctg caa ctg gtg cac ggc egg ggc gac 1152 
lie Glu Phe Ala Val Gin Leu Leu Gin Leu Val His Gly Arg Gly Asp 
370 375 380 

gag teg ctg egg acg ccc ggc acc gtc ccg gcg ctg cgc gcg etc gtc 1200 
Glu Ser Leu Arg Thr Pro Gly Thr Val Pro Ala Leu Arg Ala Leu Val 
385 390 395 400 

gee ggc ggc tac gtc ggc egg gec gac ggg gag gcg ctg ctg cgc ggc 1248 
Ala Gly Gly Tyr Val Gly Arg Ala Asp Gly Glu Ala Leu Leu Arg Gly 
405 410 415 

tac cgc ttc ctg cgc ggc gtc gag cac cgc etc cag etc cag ggg ctg 12 96 
Tyr Arg Phe Leu Arg Gly Val Glu His Arg Leu Gin Leu Gin Gly Leu 
420 425 430 

cgc cgc acc cac acc gtg ccg acc gag ccg gec gcg ctg cgc tgg ttg 1344 
Arg Arg Thr His Thr Val Pro Thr Glu Pro Ala Ala Leu Arg Trp Leu 
435 440 445 

gee gee gcg ctg ggc tac gcg gec acg ccg ggc cgc age gec gtc gag 13 92 
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Ala Ala Ala Leu Gly Tyr Ala Ala Thr Pro Gly Arg Ser Ala Val Glu 
450 455 460 

gag ttc cgc gcc gag tgg gtc acc cac gcc acc gag gta cgc egg ctg 1440 
Glu Phe Arg Ala Glu Trp Val Thr His Ala Thr Glu Val Arg Arg Leu 
465 470 475 480 

cac gcc aag ctg etc tac egg ccg ctg ctg gag teg gtg gcc egg gtg 14 88 
His Ala Lys Leu Leu Tyr Arg Pro Leu Leu Glu Ser Val Ala Arg Val 
485 490 495 

ccg gcc gac ggg ctg egg ctg acc ccg gag gcg gcc egg cac egg ctg 1536 
Pro Ala Asp Gly Leu Arg Leu Thr Pro Glu Ala Ala Arg His Arg Leu 
500 505 510 

gag ate etc ggc ttc gcc gac ccc gcc ggg gcg ctg egg cac etc cag 1584 
Glu lie Leu Gly Phe Ala Asp Pro Ala Gly Ala Leu Arg His Leu Gin 
515 520 525 

gcc etc acc ggc ggg gtg age cgc acg gcg gcc ate cag cgc acc ctg 1632 
Ala Leu Thr Gly Gly Val Ser Arg Thr Ala Ala lie Gin Arg Thr Leu 
530 535 540 

ctg ccg gtg ctg etc age gag ttc gcc gac gcc ccc gag ccg gac cgc 1680 
Leu Pro Val Leu Leu Ser Glu Phe Ala Asp Ala Pro Glu Pro Asp Arg 
545 550 555 560 

ggc ctg etc aac tac egg cag gtc tec gac aag etc ggc age acg ccc 1728 
Gly Leu Leu Asn Tyr Arg Gin Val Ser Asp Lys Leu Gly Ser Thr Pro 
565 570 575 

tgg tac ctg cgc ctg ctg cgc gac tec ggg ccg gtg gcc cgc egg ctg 1776 
Trp Tyr Leu Arg Leu Leu Arg Asp Ser Gly Pro Val Ala Arg Arg Leu 
580 585 590 

gcc egg gtg etc tec tec tec cgc tac gcc gcc gac ctg ctg gcc cgc 1824 
Ala Arg Val Leu Ser Ser Ser Arg Tyr Ala Ala Asp Leu Leu Ala Arg 
595 600 605 

gag ccg gag gcg ctg egg atg ctg gcc gag gag age gag ttg acc ccc 1872 
Glu Pro Glu Ala Leu Arg Met Leu Ala Glu Glu Ser Glu Leu Thr Pro 
610 615 620 

egg ccg age ggg gtg etc tgc gag ggc ttc gcc gcc gcc gca gcc egg 192 0 
Arg Pro Ser Gly Val Leu Cys Glu Gly Phe Ala Ala Ala Ala Ala Arg 
625 630 635 640 

cac gcc gac ccc gtc gaa gcc acc egg gcg ate cgc gcg ctg cgc cgc 1968 
His Ala Asp Pro Val Glu Ala Thr Arg Ala lie Arg Ala Leu Arg Arg 
645 650 655 

c 99" 9 a 9" ct 9 9 tc cgc ate gcc tgc gcg gac ctg ttg age egg gcc ggc 2 016 
Arg Glu Leu Val Arg lie Ala Cys Ala Asp Leu Leu Ser Arg Ala Gly 
660 665 670 

teg ctg gcc ccg teg ccg ccc egg ccc gac ggc ggg egg gcc gcg etc 2064 
Ser Leu Ala Pro Ser Pro Pro Arg Pro Asp Gly Gly Arg Ala Ala Leu 
675 680 685 
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ggt etc gec gac gtc gec gec gtg ggc acg gcg ctg gec gac gtc acc 2112 
Gly Leu Ala Asp Val Ala Ala Val Gly Thr Ala Leu Ala Asp Val Thr 
690 695 700 

gac gec acc ctg gec gcg gcg ctg egg gec gec egg gec gec cag ccg 2160 
Asp Ala Thr Leu Ala Ala Ala Leu Arg Ala Ala Arg Ala Ala Gin Pro 
705 710 715 720 

ccc atg ccg ggg ctg cgc ttc gec gtg ate ggc atg ggc cgc ctg ggc 2208 
Pro Met Pro Gly Leu Arg Phe Ala Val He Gly Met Gly Arg Leu Gly 
725 730 735 

ggg tac gag teg aac tac etc tec gac gec gac gtg etc ttc gtc tac 2256 
Gly Tyr Glu Ser Asn Tyr Leu Ser Asp Ala Asp Val Leu Phe Val Tyr 
740 745 750 

gac ccc ccg ccc ggc gee ggc gag age gcg gec ggc gcg gcg age gee 2 304 
Asp Pro Pro Pro Gly Ala Gly Glu Ser Ala Ala Gly Ala Ala Ser Ala 
755 760 765 

gec gec cac ggg ate gec gag gag ttg cgt egg ctg etc ggc atg ccc 2352 
Ala Ala His Gly He Ala Glu Glu Leu Arg Arg Leu Leu Gly Met Pro 
770 775 780 

gcg ccc gac ccg ccg ctg ggc gtg gac gee gac ctg cgt ccc gag ggc 2400 
Ala Pro Asp Pro Pro Leu Gly Val Asp Ala Asp Leu Arg Pro Glu Gly 
785 790 795 800 

egg cag ggt ccg etc gtg cgc age etc gee gcg tac gcg cag tac tac 
Arg Gin Gly Pro Leu Val Arg Ser Leu Ala Ala Tyr Ala Gin Tyr Tyr 
805 810 815 

gec cgc tgg teg aag gtg tgg gag gcg cag gcg ctg ctg cgt gee egg 2496 
Ala Arg Trp Ser Lys Val Trp Glu Ala Gin Ala Leu Leu Arg Ala Arg 
820 825 830 

ttc gtc tgc ggc gac gec gac etc ggc gcg gag ttc gag gcg atg gtc 2 544 
Phe Val Cys Gly Asp Ala Asp Leu Gly Ala Glu Phe Glu Ala Met Val 
835 840 845 

gac ccg gtc cgc tac ccg gee gac ggg ttg acc cgc gag cag gtg gtg 2592 
Asp Pro Val Arg Tyr Pro Ala Asp Gly Leu Thr Arg Glu Gin Val Val 
850 855 860 

gag ate egg egg ate aag gcg egg gtg gag cac gag egg ctg ccc egg 264 0 
Glu He Arg Arg He Lys Ala Arg Val Glu His Glu Arg Leu Pro Arg 
865 870 875 880 

ggc gee gac ccg gec acc cac acc aag etc ggg egg ggc ggc etc gee 2688 
Gly Ala Asp Pro Ala Thr His Thr Lys Leu Gly Arg Gly Gly Leu Ala 
885 890 895 

gac gtc gag tgg gcg gtg caa ctg etc cag etc egg cac gee ggg acg 2736 
Asp Val Glu Trp Ala Val Gin Leu Leu Gin Leu Arg His Ala Gly Thr 
900 905 910 

gtc ccg egg ctg cgc ggc acg cgt acg etc gac gee etc gcg gcg gee 2784 



2448 
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Val Pro Arg Leu Arg Gly Thr Arg Thr Leu Asp Ala Leu Ala Ala Ala 
915 920 925 



egg gac gcg ggg ctg gtc gac ccg acg gac gec acc gag atg gcg gec 2832 
Arg Asp Ala Gly Leu Val Asp Pro Thr Asp Ala Thr Glu Met Ala Ala 
930 935 940 

ggc tgg acc ctg gec gcg cag gtc cgc aac gcg ctg atg ctg gtc cgc 2 88 0 
Gly Trp Thr Leu Ala Ala Gin Val Arg Asn Ala Leu Met Leu Val Arg 
945 950 955 960 

ggc egg gec ggc gac cag ttg ccc egg cac ggc gtc gag ttg gec ggg 2928 
Gly Arg Ala Gly Asp Gin Leu Pro Arg His Gly Val Glu Leu Ala Gly 
965 970 975 

gtg gtc egg ctg etc ggc egg gac gat ccc ggc gag ttc etc gac gag 2976 
Val Val Arg Leu Leu Gly Arg Asp Asp Pro Gly Glu Phe Leu Asp Glu 
980 985 990 

tac ctg cgc acc ggc cgc cgc tec cgc gcg gcg atg gag egg gtc etc 3 024 
Tyr Leu Arg Thr Gly Arg Arg Ser Arg Ala Ala Met Glu Arg Val Leu 
995 1000 1005 

gac gee tga 3 033 

Asp Ala * 
1010 



<210> 82 
<211> 296 
<212> PRT 
<213> Bacteria 



<400> 82 
Val lie Leu Val 
1 

Ala Asp Ala Gly 
20 

Trp Leu Thr Leu 
35 

Gly Asp Leu Arg 
50 

Val Thr Pro Gly 
65 

Asp Arg lie Gly 

Leu Pro Pro Gly 
100 

Ala Leu Trp Tyr 
115 

Thr Asp Gly Lys 
130 

Ala Ser Met Val 
145 

Asn Gin Ala Asn 

Leu His Pro Leu 
180 



Ala Leu Ala Lys 
5 

Pro Tyr Gly Val 

Val His Ala Gly 
40 

Thr Trp Gin Val 
55 

Pro Asp Gly Ala 
70 

Arg lie Thr Thr 
85 

Ser Gly Pro Cys 

Ala Ala Met Thr 
120 

Val Thr Gin Phe 
135 

Ala Ala Gly Pro 
150 

Ala Val Gly Arg 
165 

Pro Thr Glu Gly 



He Arg Glu Val 
10 

Thr Val Gly Pro 
25 

Ala Val Ala Arg 

Ala Ala Asp Ser 
60 

Leu Trp Phe Thr 
75 

Asp Gly Glu Gin 
90 

Gly He Ala Ala 
105 

Ala Asp Ala Val 

Pro Leu Pro Val 
140 

Asp Glu Ala Val 
155 

He Gly Thr Asp 
170 

Ala Ala Pro Val 
185 



Pro Leu Thr Gly 
15 

Asp Gly Ala Leu 
30 

Val Gly Ala Asp 
45 

Arg Pro Leu He 

Arg Ser Gly Asp 
80 

Ser Ala Val Ala 
95 

Gly Pro Asp Gly 
110 

Gly Arg Val Thr 
125 

Ser Gly Gly Phe 

Trp Phe Thr Leu 
160 

Gly Ala Val Ala 
175 

Gly He Thr Ala 
190 
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Gly Ala Asp Gly 
195 

Gly Arg lie Thr 
210 

Arg Ala Ala Arg 
225 

Trp Phe Thr Glu 

Gly Thr lie Val 
260 

Gly lie Thr Val 
275 

Gly Ala Leu Ala 
290 



Ala Leu Trp Phe 
200 

Pro Asp Gly Arg 
215 

Pro His Ala lie 
230 

Trp Gly Gly Asn 
245 

Thr His Asp Leu 

Ala Pro Asp Gly 
280 

His Leu Thr Pro 
295 



Val Glu He Gly 

He Asp Glu Tyr 
220 

Val Ala Asp Pro 
235 

Arg He Gly His 
250 

Pro Thr Pro Ala 
265 

Thr Val Trp Ala 



Ala Gly Gin Leu 
205 

Pro Leu Pro Asp 

Ala Gly Gly Cys 
240 

Val Ala Pro Asp 
255 

Ala Glu Pro His 
270 

Ala Leu Glu Thr 
285 



cgc ate gag ttc cgc egg cac gac etc acc egg ace ttc ccg ccg ggc 
Arg He Glu Phe Arg Arg His Asp Leu Thr Arg Thr Phe Pro Pro Gly 
115 120 125 



48 



<210> 83 
<211> 705 
<212> DNA 
<213> Bacteria 

<220> 
<221> CDS 
<222> (1) . . . (705) 

<400> 83 

gtg cga cac gac gga acg gec ggg gag cac egg cac gac agg acg gcg 

Val Arg His Asp Gly Thr Ala Gly Glu His Arg His Asp Arg Thr Ala 

15 10 15 

gcg ccg gtg gac gac cat tgg egg cac ccg gac gtg gac gag gag acc 96 
Ala Pro Val Asp Asp His Trp Arg His Pro Asp Val Asp Glu Glu Thr 
20 25 30 

get egg tac tgg gag gag etc tac ggg egg cgc gac egg tac tgg age 144 
Ala Arg Tyr Trp Glu Glu Leu Tyr Gly Arg Arg Asp Arg Tyr Trp Ser 
35 40 45 

ggg egg gcg aac ccg ate ctg gtc gac gtc gee ggg ccg ctg ccg gee 192 
Gly Arg Ala Asn Pro He Leu Val Asp Val Ala Gly Pro Leu Pro Ala 
50 55 60 

ggc acc gcg ctg gac etc ggc tgc ggc gag ggc ggc gac gcg ate tgg 24 0 
Gly Thr Ala Leu Asp Leu Gly Cys Gly Glu Gly Gly Asp Ala He Trp 
65 70 75 80 

ctg gee ggg egg ggc tgg egg gtg acg gcg gtg gac gtc gee gag acc 28 8 
Leu Ala Gly Arg Gly Trp Arg Val Thr Ala Val Asp Val Ala Glu Thr 
85 90 95 

gec etc gac egg gcg tec gca gcg gcg gec gag gec ggg gtg gcg tec 3 36 
Ala Leu Asp Arg Ala Ser Ala Ala Ala Ala Glu Ala Gly Val Ala Ser 
100 105 HO 



384 
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cracr ttc 


gac 


ctg 


crtc 


tec 


gcg 


cag 


ttc 


etc 


cag 


teg 


ccg 


ctg 


gag 


ttc 


Glu Phe 


Asp 


Leu 


Val* 


Ser 


Ala 


Gin 


Phe 


Leu 


Gin 


Ser 


Pro 


Leu 


Glu 


Phe 


130 










135 










140 










ccc egg 


aqa 


cracr 


gtg 


ctg 


cgc 


teg 


gcg 


gee 


egg 


gee 


gtg 


gee 


ccc 


ggc 


Pro Arg 


Gly 


Glu 


Val 


Leu 


Arg 


Ser 


Ala 


Ala 


Arg 


Ala 


Val 


Ala 


Pro 


Gly 


145 








150 










155 










160 


crcfc ccrcr 


CtQ 


etc 


qtC 


gtc 


gag 


cac 




gag 


gtc 


ccg 


ccg 


tgg 


gga 


egg 


Gly Arg 


Leu 


Leu 


Val 


Val 


Glu 


His 


Gly Glu Val 


Pro 


Pro 


Trp 


Gly Arg 








165 










170 










175 




cac gcg 


cac 


cccr 

w ^5 


gac 


gtg 


cgc 


ttc 


ccc 


acc 


ccg 


cag 


gag 


acc 


etc 


gee 


His Ala 


His 


Pro 


Asp 


Val 


Arg 


Phe 


Pro 


Thr 


Pro 


Gin 


Glu 


Thr 


Leu 


Ala 






180 










185 










190 






cracr c tcr 


crac 


etc 


gac 


ccg 


gac 


egg 


tCTCf 


etc 


acc 


gag 


egg 


etc 


gac 


gec 


Glu Leu 


Asp 


Leu 


Asp 


Pro 


Asp 


Arg 


Trp 


Leu 


Thr 


Glu 


Arg 


Leu 


Asp 


Ala 




195 










200 










205 








ccg cgc 


ccrcr 


cag 


gec 


acc 


ggc 


ccg 


gac 


yy 


cat 


acc 


ggg 


acc 


etc 


crtc 


Pro Arg 


Arg 


Gin 


Ala 


Thr 


Gly 


Pro 


Asp 


PI -.r 

_L y 


riJ_ o 


Thr 


Gly 


Thr 


Leu 


Val 


210 










215 










220 










gac cac 


gtg 


CftQ 


ctg 


gtc 


cgc 


cgc 


cgc 


ccg 


tag 












Asp His 


Val 


Val 


Leu 


Val 


Arg 


Arg 


Arg 


Pro 














225 








230 






















<210> 84 




























<211> 234 




























<212> PRT 




























<213> Bacteria 


























<400> 84 




























Val Arg 


His 


Asp 


Glv 


Thr 


Ala 


Glv 


Glu 


His 


Arg 


His 


Asp 


Arg 


Thr 


Ala 


1 






5 










10 










15 




Ala Pro 


Val 


Asp 


Asp 


His 


Trp 


Arg 


His 


Pro 


Asp 


Val 


Asp 


Glu 


Glu 


Thr 






20 










25 










30 






Ala Arg 


Tyr 


Trp 


Glu 


Glu 


Leu 


Tvr 


Gly Arg Arg 


Asp 


Arg 


Tyr 


Trp 


Ser 




35 










40 










45 








Gly Arg 


Ala 


Asn 


Pro 


lie 


Leu 


Val 


Asp 


Val 


Ala 


Gly 


Pro 


Leu 


Pro 


Ala 


50 










55 










60 










Gly Thr 


Ala 


Leu 


Asp 


Leu 


Glv 


Cys 


Gly 


Glu 


Gly 


Gly Asp 


Ala 


He 


Trp 


65 








70 










75 










80 


Leu Ala 


Gly 


Arg 


Glv 




Arg 


Val 


Thr 


Ala 


Val 


Asp 


Val 


Ala 


Glu 


Thr 








85 










90 










95 




Ala Leu 


Asp 


Arg 


Ala 


Ser 


Ala 


Ala 


Ala 


Ala 


Glu 


Ala 


Gly Val 


Ala 


Ser 






100 










105 










110 






Arg lie 


Glu 


Phe 


Arg 


Arg 


His 


Asp 


Leu 


Thr 


Arg 


Thr 


Phe 


Pro 


Pro 


Gly 




115 










120 










125 








Glu Phe 


Asp 


Leu 


Val 




Ala 


Gin 


Phe 


Leu 


Gin 


Ser 


Pro 


Leu 


Glu 


Phe 


130 










135 










140 










Pro Arg 


Gly 


Glu 


Val 


Leu 


Arg 


Ser 


Ala 


Ala 


Arg 


Ala 


Val 


Ala 


Pro 


Gly 


145 








150 










155 










160 


Gly Arg 


Leu 


Leu 


Val 


Val 


Glu 


His 


Gly 


Glu 


Val 


Pro 


Pro 


Trp 


Gly 


Arg 








165 










170 










175 




His Ala 


His 


Pro 


Asp 


Val 


Arg 


Phe 


Pro 


Thr 


Pro 


Gin 


Glu 


Thr 


Leu 


Ala 



432 



480 



528 



576 



624 



672 



705 



DC01 348871 v 1 



111 



180 

Glu Leu Asp Leu Asp Pro Asp Arg 
195 200 
Pro Arg Arg Gin Ala Thr Gly Pro 

210 215 
Asp His Val Val Leu Val Arg Arg 
225 230 



185 190 
Trp Leu Thr Glu Arg Leu Asp Ala 
205 

Asp Gly His Thr Gly Thr Leu Val 
220 

Arg Pro 



<210> 85 
<211> 1320 
<212> DNA 
<213> . 



<220> 
<221> CDS 

<222> (1) . . . (1320) 



<400> 85 

gtg ggc atg egg cgc agt egg gtg gtg gcg gtg gec gec gcg tec gec 
Val Gly Met Arg Arg Ser Arg Val Val Ala Val Ala Ala Ala Ser Ala 
15 10 15 



gtg etc etc ggc gtg acg tat etc get ctt ccg ccg acc ggt tec gac 96 
Val Leu Leu Gly Val Thr Tyr Leu Ala Leu Pro Pro Thr Gly Ser Asp 
20 25 30 



etc gec gcg cag gtc gee egg gec gac ttc ttc gec gec cac ggc etc 144 
Leu Ala Ala Gin Val Ala Arg Ala Asp Phe Phe Ala Ala His Gly Leu 
35 40 45 



gee ccg gtc gac ctg cgc tgg tac ggc ggg gtc cag cag ttc ggc tac 192 
Ala Pro Val Asp Leu Arg Trp Tyr Gly Gly Val Gin Gin Phe Gly Tyr 
50 55 60 



age ctg gtc tec cag ccg gtg atg gcg ctg etc ggg gtg egg gtc acc 
Ser Leu Val Ser Gin Pro Val Met Ala Leu Leu Gly Val Arg Val Thr 
65 70 75 80 



ggc gtg ctg gcg ctg gtg gcg gcg gcg acc gcg ttc gcg gcg ctg ctg 28 8 
Gly Val Leu Ala Leu Val Ala Ala Ala Thr Ala Phe Ala Ala Leu Leu 
85 90 95 



gtg cgc acc ggg gtg ccg cgc ccg ctg etc ggc age ctg gtc ggg gtg 33 6 
Val Arg Thr Gly Val Pro Arg Pro Leu Leu Gly Ser Leu Val Gly Val 
100 105 110 



gtc acg ate gec ggc aac ctg gtc teg ggc egg gtg acg tac ggc etc 3 84 
Val Thr He Ala Gly Asn Leu Val Ser Gly Arg Val Thr Tyr Gly Leu 
115 120 125 



ggg gtg gee ttc ggc etc 
Gly Val Ala Phe Gly Leu 
130 



ggc gcg ctg etc gec 
Gly Ala Leu Leu Ala 
135 



etc acc etc ccc cgc 432 

Leu Thr Leu Pro Arg 

140 



ggc ccg gee gca egg gec gee gac tec gac ccg gec gca ccg gec gac 
Gly Pro Ala Ala Arg Ala Ala Asp Ser Asp Pro Ala Ala Pro Ala Asp 
145 150 155 160 
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tec gac gec gac ggg egg gcg egg cga egg cag gtc gcg egg etc ggg 52 8 
Ser Asp Ala Asp Gly Arg Ala Arg Arg Arg Gin Val Ala Arg Leu Gly 
165 170 175 

ctg gcg gtc gee ggg gcg ctg ctg gee teg gcg gcg age ccg gtg gcg 576 
Leu Ala Val Ala Gly Ala Leu Leu Ala Ser Ala Ala Ser Pro Val Ala 
180 185 190 

ggc etc ttc gtc ggc ctg gec ggc gcg gcg ctg ctg etc acc cgc egg 624 
Gly Leu Phe Val Gly Leu Ala Gly Ala Ala Leu Leu Leu Thr Arg Arg 
195 200 205 

tac gec gac ggc ctg gcg etc ggc gtc gee gee gcg ctg ccg etc ggg 6 72 
Tyr Ala Asp Gly Leu Ala Leu Gly Val Ala Ala Ala Leu Pro Leu Gly 
210 215 220 

gcg acc gcg ctg etc ttc ggc gac ggc ggc tgg atg aac ate age cgc 72 0 
Ala Thr Ala Leu Leu Phe Gly Asp Gly Gly Trp Met Asn lie Ser Arg 
225 230 235 240 

acc gac acg ctg cgc gee gtg ctg acc age ctg ctg gtc gee gcg ctg 768 
Thr Asp Thr Leu Arg Ala Val Leu Thr Ser Leu Leu Val Ala Ala Leu 
245 250 255 

gtg gcg tac egg ccg gtg egg gtg ggc gcg ctg etc teg gcg gec ggg 816 
Val Ala Tyr Arg Pro Val Arg Val Gly Ala Leu Leu Ser Ala Ala Gly 
260 265 270 

gtg ctg gcg gcg gcg ctg gtg cac acc ccg gtc ggg ctg aac gee acc 864 
Val Leu Ala Ala Ala Leu Val His Thr Pro Val Gly Leu Asn Ala Thr 
275 280 285 

egg ctg gcg gtc atg ttc ggc ctg ccg ctg ctg gec gec gec gee cgc 912 
Arg Leu Ala Val Met Phe Gly Leu Pro Leu Leu Ala Ala Ala Ala Arg 
290 295 300 

ccc ccg gtc ggg ctg gcg egg tgg tgg gee cga cgc ggg egg ggc gcg 96 0 
Pro Pro Val Gly Leu Ala Arg Trp Trp Ala Arg Arg Gly Arg Gly Ala 
305 310 315 320 

gcg egg ggc ggg gtg ggc ggc egg gac gcg gcg cag ggg egg age aag 10 08 
Ala Arg Gly Gly Val Gly Gly Arg Asp Ala Ala Gin Gly Arg Ser Lys 
325 330 335 

gtc egg ggc cgc gtg gcg ctg gee acg ctg ctg gcg gec ggc tgc tgg 1056 
Val Arg Gly Arg Val Ala Leu Ala Thr Leu Leu Ala Ala Gly Cys Trp 
340 345 350 

tgg cag ccg ccg gtg ccc ccc gee gac ctg cgc age gtc gac gac ccg 1104 
Trp Gin Pro Pro Val Pro Pro Ala Asp Leu Arg Ser Val Asp Asp Pro 
355 360 365 

acc ggc egg gec gcg tac ttg cgc cgc tgc ggg agt tec teg acg ggc 1152 
Thr Gly Arg Ala Ala Tyr Leu Arg Arg Cys Gly Ser Ser Ser Thr Gly 
370 375 380 

a 9 c 99 c tca cc 9 9 CC 999 tc 9 a 99 t9 c C 9 C c 9 a ccc 9 ca act act 999 12 00 
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Ser Gly Ser Pro Ala Gly Ser Arg Cys Arg Arg Pro Ala Thr Thr Gly 
385 390 395 400 



agg egg cgc ggc tgg gcg agg tgc cgc tgg ccc ggg get ggc tgc ggc 124 8 
Arg Arg Arg Gly Trp Ala Arg Cys Arg Trp Pro Gly Ala Gly Cys Gly 
405 410 415 

agg ccg aca teg ace gga acc ccc tct tct tea cca ccg tec egg gcg 12 96 
Arg Pro Thr Ser Thr Gly Thr Pro Ser Ser Ser Pro Pro Ser Arg Ala 
420 425 430 

egg ccg gca ccg ggg tgc cgc tga 13 2 0 

Arg Pro Ala Pro Gly Cys Arg * 
435 



<210> 86 
<211> 439 
<212> PRT 
<213> Bacteria 



<400> 86 
Val Gly Met Arg 
1 

Val Leu Leu Gly 
20 

Leu Ala Ala Gin 
35 

Ala Pro Val Asp 
50 

Ser Leu Val Ser 
65 

Gly Val Leu Ala 

Val Arg Thr Gly 
100 

Val Thr lie Ala 
115 

Gly Val Ala Phe 
130 

Gly Pro Ala Ala 
145 

Ser Asp Ala Asp 

Leu Ala Val Ala 
180 

Gly Leu Phe Val 
195 

Tyr Ala Asp Gly 
210 

Ala Thr Ala Leu 
225 

Thr Asp Thr Leu 

Val Ala Tyr Arg 
260 

Val Leu Ala Ala 
275 



Arg Ser Arg Val 
5 

Val Thr Tyr Leu 

Val Ala Arg Ala 
40 

Leu Arg Trp Tyr 
55 

Gin Pro Val Met 
70 

Leu Val Ala Ala 
85 

Val Pro Arg Pro 

Gly Asn Leu Val 
120 

Gly Leu Gly Ala 
135 

Arg Ala Ala Asp 
150 

Gly Arg Ala Arg 
165 

Gly Ala Leu Leu 

Gly Leu Ala Gly 
200 

Leu Ala Leu Gly 
215 

Leu Phe Gly Asp 
230 

Arg Ala Val Leu 
245 

Pro Val Arg Val 

Ala Leu Val His 
280 



Val Ala Val Ala 
10 

Ala Leu Pro Pro 
25 

Asp Phe Phe Ala 

Gly Gly Val Gin 
60 

Ala Leu Leu Gly 
75 

Ala Thr Ala Phe 
90 

Leu Leu Gly Ser 
105 

Ser Gly Arg Val 

Leu Leu Ala Leu 
140 

Ser Asp Pro Ala 
155 

Arg Arg Gin Val 
170 

Ala Ser Ala Ala 
185 

Ala Ala Leu Leu 

Val Ala Ala Ala 
220 

Gly Gly Trp Met 
235 

Thr Ser Leu Leu 
250 

Gly Ala Leu Leu 
265 

Thr Pro Val Gly 



Ala Ala Ser Ala 
15 

Thr Gly Ser Asp 
30 

Ala His Gly Leu 
45 

Gin Phe Gly Tyr 

Val Arg Val Thr 
80 

Ala Ala Leu Leu 
95 

Leu Val Gly Val 
110 

Thr Tyr Gly Leu 
125 

Thr Leu Pro Arg 

Ala Pro Ala Asp 
160 

Ala Arg Leu Gly 
175 

Ser Pro Val Ala 
190 

Leu Thr Arg Arg 
205 

Leu Pro Leu Gly 

Asn lie Ser Arg 
240 

Val Ala Ala Leu 
255 

Ser Ala Ala Gly 
270 

Leu Asn Ala Thr 
285 
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Arg Leu Ala Val 
290 

Pro Pro Val Gly 
305 

Ala Arg Gly Gly 

Val Arg Gly Arg 
340 

Trp Gin Pro Pro 
355 

Thr Gly Arg Ala 
370 

Ser Gly Ser Pro 
385 

Arg Arg Arg Gly 

Arg Pro Thr Ser 
420 

Arg Pro Ala Pro 
435 



Met Phe Gly Leu 
295 

Leu Ala Arg Trp 
310 

Val Gly Gly Arg 
325 

Val Ala Leu Ala 

Val Pro Pro Ala 
360 

Ala Tyr Leu Arg 
375 

Ala Gly Ser Arg 
390 

Trp Ala Arg Cys 
405 

Thr Gly Thr Pro 
Gly Cys Arg 



Pro Leu Leu Ala 
300 

Trp Ala Arg Arg 
315 

Asp Ala Ala Gin 
330 

Thr Leu Leu Ala 
345 

Asp Leu Arg Ser 

Arg Cys Gly Ser 
380 

Cys Arg Arg Pro 
395 

Arg Trp Pro Gly 
410 

Ser Ser Ser Pro 
425 



Ala Ala Ala Arg 

Gly Arg Gly Ala 
320 

Gly Arg Ser Lys 
335 

Ala Gly Cys Trp 
350 

Val Asp Asp Pro 
365 

Ser Ser Thr Gly 

Ala Thr Thr Gly 
400 

Ala Gly Cys Gly 
415 

Pro Ser Arg Ala 
430 



<210> 87 
<211> 1431 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (1431) 

<400> 87 

atg tec ggc gtg cct cac cac etc gcg cgc tgg ate ggc ctg gee ggc 
Met Ser Gly Val Pro His His Leu Ala Arg Trp lie Gly Leu Ala Gly 
15 10 15 



48 



teg acg ctg etc gec gtg gec gcg ttc etc ggc gga gcg ctg ccc gac 96 
Ser Thr Leu Leu Ala Val Ala Ala Phe Leu Gly Gly Ala Leu Pro Asp 
20 25 30 

ggc gat ttg cgc ccc acc ccg etc age ate tgg cag ggc ccg cac ggc 144 
Gly Asp Leu Arg Pro Thr Pro Leu Ser lie Trp Gin Gly Pro His Gly 
35 40 45 

ccg ttg ate ate gee acc tgg gcg gtc ggc acg ggc ctg atg gcg tac 192 
Pro Leu lie lie Ala Thr Trp Ala Val Gly Thr Gly Leu Met Ala Tyr 
50 55 60 

gec tgg tgg gcg ctg cgc gac egg gtg ccg teg acc cgc tgg gee gtg 24 0 
Ala Trp Trp Ala Leu Arg Asp Arg Val Pro Ser Thr Arg Trp Ala Val 
65 70 75 80 

gtc acc gee ggg etc tgg ctg ctg ccg ctg ctg gtc gcg ccg ccg ctg 288 
Val Thr Ala Gly Leu Trp Leu Leu Pro Leu Leu Val Ala Pro Pro Leu 
85 90 95 

ggc age cga gac gtc tac gcg tac gee tgc cag ggc gee age tac tec 33 6 
Gly Ser Arg Asp Val Tyr Ala Tyr Ala Cys Gin Gly Ala Ser Tyr Ser 
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115 



100 



105 



110 



gcc ggc ate aac ccg tac gag cag ggt gtc teg gca ctg ccc tgc ccg 3 84 
Ala Gly lie Asn Pro Tyr Glu Gin Gly Val Ser Ala Leu Pro Cys Pro 
115 120 125 

tgg ctg gac acc ate tec tac ate tgg egg gac acc teg gcc ccg tac 432 
Trp Leu Asp Thr lie Ser Tyr lie Trp Arg Asp Thr Ser Ala Pro Tyr 
130 135 140 

ggg ccg ctg ttc ctg ctg ate gcc ggg gcg gtg gtc gag gcg acc ggg 4 80 
Gly Pro Leu Phe Leu Leu lie Ala Gly Ala Val Val Glu Ala Thr Gly 
145 150 155 160 

teg ctg acc ggc age ate gtg ctg ttc egg ctg ctg gcg gtg gcc ggg 528 
Ser Leu Thr Gly Ser lie Val Leu Phe Arg Leu Leu Ala Val Ala Gly 
165 170 175 

gtg ggg ctg acc gcg gcc tgc ctg ccg ccg ctg gcc egg cgc tgc ggc 576 
Val Gly Leu Thr Ala Ala Cys Leu Pro Pro Leu Ala Arg Arg Cys Gly 
180 185 190 

gtg ccg gcc ggc egg gcg gtc tgg ctg gcg ctg ggc teg ccg ctg ate 624 
Val Pro Ala Gly Arg Ala Val Trp Leu Ala Leu Gly Ser Pro Leu lie 
195 200 205 

ggg gtg cac ctg ate teg ggc gcg cac aac gac gcg ctg atg gtg ggg 672 
Gly Val His Leu lie Ser Gly Ala His Asn Asp Ala Leu Met Val Gly 
210 215 220 

ctg etc gtg gcc ggg ctg gcg atg gtg gtg gcc egg ccg ggc cgc ccc 72 0 
Leu Leu Val Ala Gly Leu Ala Met Val Val Ala Arg Pro Gly Arg Pro 
225 230 235 240 

ggc ccg ctg etc gcc ggg gga gcg ctg etc ggc etc gcc ggc gcc gtc 
Gly Pro Leu Leu Ala Gly Gly Ala Leu Leu Gly Leu Ala Gly Ala Val 
245 250 255 

aag gtc acc gcg ctg gtg gtg gtg ccg ttc gcg gcg etc gcc gcg ate 
Lys Val Thr Ala Leu Val Val Val Pro Phe Ala Ala Leu Ala Ala lie 
260 265 270 

gtc ggg gcg tac teg ate agg gcg ttg ate cgc gac ggt ggg tgg gtg 864 
Val Gly Ala Tyr Ser lie Arg Ala Leu lie Arg Asp Gly Gly Trp Val 
275 280 285 

gtc ggc ggg gcg etc gcg gcg gtc gtc ggc gcg acc etc gcc age ggc 912 
Val Gly Gly Ala Leu Ala Ala Val Val Gly Ala Thr Leu Ala Ser Gly 
290 295 300 



ctg ggc ttc ggc tgg gtc acc ggg ctg gag cag ggc ggc ctg gtg ate 
Leu Gly Phe Gly Trp Val Thr Gly Leu Glu Gin Gly Gly Leu Val lie 
305 310 315 320 

gcc tgg acc teg ccc ccg acg gcg gtg ggg cag acc gtc gcc tac etc 
Ala Trp Thr Ser Pro Pro Thr Ala Val Gly Gin Thr Val Ala Tyr Leu 
325 330 335 



768 



816 



960 



1008 
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gcc gcg ccg ttc ggc tgg cac ggc gat ccg ctg ccg gtc acc egg ggc 1056 
Ala Ala Pro Phe Gly Trp His Gly Asp Pro Leu Pro Val Thr Arg Gly 
340 345 350 



ate ggg atg gcc gtg etc gcg ctg gtg ctg ate tgg ctg tgg tgg egg 1104 
lie Gly Met Ala Val Leu Ala Leu Val Leu lie Trp Leu Trp Trp Arg 
355 360 365 



gcc cgc acc egg gag ccg 
Ala Arg Thr Arg Glu Pro 
370 



ctg tgg cac gcc ggc 
Leu Trp His Ala Gly 
375 



ctg gcg ctg gcc gcc 1152 

Leu Ala Leu Ala Ala 

380 



acg gtc gcg etc gcc ccg ctg ttc cac ccc tgg tac tgg acc tgg ccg 12 0 0 
Thr Val Ala Leu Ala Pro Leu Phe His Pro Trp Tyr Trp Thr Trp Pro 
385 390 395 400 



ctg gcc gtg etc gcg gcc acg teg egg cgc acc ggc tgg ttc gcg etc 1248 
Leu Ala Val Leu Ala Ala Thr Ser Arg Arg Thr Gly Trp Phe Ala Leu 
405 410 415 



gtc gcg gtg etc teg gcg ttc ctg gtc etc gcg gac ggc acc ggg ctg 12 96 
Val Ala Val Leu Ser Ala Phe Leu Val Leu Ala Asp Gly Thr Gly Leu 
420 425 430 



gcc egg tac age aag acg gtc ggc gcc ccg ctg atg acg ctg ttg gtg 1344 
Ala Arg Tyr Ser Lys Thr Val Gly Ala Pro Leu Met Thr Leu Leu Val 
435 440 445 



atg gtg gtg gcc gtc cgc ttg gta egg teg get tgg gcg gcc cgc egg 13 92 
Met Val Val Ala Val Arg Leu Val Arg Ser Ala Trp Ala Ala Arg Arg 
450 455 460 



teg get egg gcg gcc cgc egg ccg gcc gcc gtg aac tga 
Ser Ala Arg Ala Ala Arg Arg Pro Ala Ala Val Asn * 
465 470 475 



<210> 88 
<211> 476 
<212> PRT 
<213> Bacteria 



<400> 88 

Met Ser Gly Val Pro His His Leu Ala Arg Trp He Gly Leu Ala Gly 

15 10 15 

Ser Thr Leu Leu Ala Val Ala Ala Phe Leu Gly Gly Ala Leu Pro Asp 

20 25 30 

Gly Asp Leu Arg Pro Thr Pro Leu Ser He Trp Gin Gly Pro His Gly 

35 40 45 

Pro Leu He He Ala Thr Trp Ala Val Gly Thr Gly Leu Met Ala Tyr 

50 55 60 

Ala Trp Trp Ala Leu Arg Asp Arg Val Pro Ser Thr Arg Trp Ala Val 
65 70 75 80 

Val Thr Ala Gly Leu Trp Leu Leu Pro Leu Leu Val Ala Pro Pro Leu 

85 90 95 

Gly Ser Arg Asp Val Tyr Ala Tyr Ala Cys Gin Gly Ala Ser Tyr Ser 

100 105 110 

Ala Gly He Asn Pro Tyr Glu Gin Gly Val Ser Ala Leu Pro Cys Pro 
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115 

Trp Leu Asp Thr 
130 

Gly Pro Leu Phe 
145 

Ser Leu Thr Gly 

Val Gly Leu Thr 
180 

Val Pro Ala Gly 
195 

Gly Val His Leu 
210 

Leu Leu Val Ala 
225 

Gly Pro Leu Leu 

Lys Val Thr Ala 
260 

Val Gly Ala Tyr 
275 

Val Gly Gly Ala 
290 

Leu Gly Phe Gly 
305 

Ala Trp Thr Ser 

Ala Ala Pro Phe 
340 

lie Gly Met Ala 
355 

Ala Arg Thr Arg 
370 

Thr Val Ala Leu 
385 

Leu Ala Val Leu 

Val Ala Val Leu 
420 

Ala Arg Tyr Ser 
435 

Met Val Val Ala 
450 

Ser Ala Arg Ala 
465 



120 

lie Ser Tyr lie 
135 

Leu Leu lie Ala 
150 

Ser lie Val Leu 
165 

Ala Ala Cys Leu 

Arg Ala Val Trp 
200 

He Ser Gly Ala 
215 

Gly Leu Ala Met 
230 

Ala Gly Gly Ala 
245 

Leu Val Val Val 

Ser He Arg Ala 
280 

Leu Ala Ala Val 
295 

Trp Val Thr Gly 
310 

Pro Pro Thr Ala 
325 

Gly Trp His Gly 

Val Leu Ala Leu 
360 

Glu Pro Leu Trp 
375 

Ala Pro Leu Phe 
390 

Ala Ala Thr Ser 
405 

Ser Ala Phe Leu 

Lys Thr Val Gly 
440 

Val Arg Leu Val 
455 

Ala Arg Arg Pro 
470 



Trp Arg Asp Thr 
140 

Gly Ala Val Val 
155 

Phe Arg Leu Leu 
170 

Pro Pro Leu Ala 
185 

Leu Ala Leu Gly 

His Asn Asp Ala 
220 

Val Val Ala Arg 
235 

Leu Leu Gly Leu 
250 

Pro Phe Ala Ala 
265 

Leu He Arg Asp 

Val Gly Ala Thr 
300 

Leu Glu Gin Gly 
315 

Val Gly Gin Thr 
330 

Asp Pro Leu Pro 
345 

Val Leu He Trp 

His Ala Gly Leu 
380 

His Pro Trp Tyr 
395 

Arg Arg Thr Gly 
410 

Val Leu Ala Asp 
425 

Ala Pro Leu Met 

Arg Ser Ala Trp 
460 

Ala Ala Val Asn 
475 



125 

Ser Ala Pro Tyr 

Glu Ala Thr Gly 
160 

Ala Val Ala Gly 
175 

Arg Arg Cys Gly 
190 

Ser Pro Leu He 
205 

Leu Met Val Gly 

Pro Gly Arg Pro 
240 

Ala Gly Ala Val 
255 

Leu Ala Ala He 
270 

Gly Gly Trp Val 
285 

Leu Ala Ser Gly 

Gly Leu Val He 
320 

Val Ala Tyr Leu 
335 

Val Thr Arg Gly 
350 

Leu Trp Trp Arg 
365 

Ala Leu Ala Ala 

Trp Thr Trp Pro 
400 

Trp Phe Ala Leu 
415 

Gly Thr Gly Leu 
430 

Thr Leu Leu Val 
445 

Ala Ala Arg Arg 



<210> 89 
<211> 1509 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> (1) . . . (1509) 

<400> 89 

gtg acc aca ccc ggc tec ccg teg acc teg ccc gac gtc teg ccg teg 
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Val Thr Thr Pro Gly Ser Pro Ser Thr Ser Pro Asp Val Ser Pro Ser 
15 10 15 



ccg gat gcc gcc egg etc gec egg tac gcg ggc ctg ggc ggg gcg gtg 96 
Pro Asp Ala Ala Arg Leu Ala Arg Tyr Ala Gly Leu Gly Gly Ala Val 
20 25 30 

ctg ttg gcc gtc gcc ggc tgg egg ggc ggg gcg ctg ccg teg ace ccg 144 
Leu Leu Ala Val Ala Gly Trp Arg Gly Gly Ala Leu Pro Ser Thr Pro 
35 40 45 

ctg gac gtc ccc ccg ggg gac cgt tgg ctg teg gac ggt ggg ccg ctg 192 
Leu Asp Val Pro Pro Gly Asp Arg Trp Leu Ser Asp Gly Gly Pro Leu 
50 55 60 

acg ctg ggg gtc tgg ctg gtc ggc acg gcc ctg ctg gtc ggc gcc tgg 240 
Thr Leu Gly Val Trp Leu Val Gly Thr Ala Leu Leu Val Gly Ala Trp 
65 70 75 80 

tgg gcg ctg cgc egg ggc gcg ccg tec acg egg tgg gcg tac ctg acc 288 
Trp Ala Leu Arg Arg Gly Ala Pro Ser Thr Arg Trp Ala Tyr Leu Thr 
85 90 95 

gcc ggg ctg tgg gcg ctg ccg ctg ctg gtc acc ccg ccg ctg ggc age 336 
Ala Gly Leu Trp Ala Leu Pro Leu Leu Val Thr Pro Pro Leu Gly Ser 
100 105 HO 

egg gac gtc tac tec tac gcc tgc cag ggc tgg gcg tac gcg cac ggc 3 84 
Arg Asp Val Tyr Ser Tyr Ala Cys Gin Gly Trp Ala Tyr Ala His Gly 
115 120 125 

gtc gac ccg tac gcg acc ggg gtg gcc gag gcc ggc tgc ccc tgg gtg 432 
Val Asp Pro Tyr Ala Thr Gly Val Ala Glu Ala Gly Cys Pro Trp Val 
130 135 140 

gag teg gtc gcg ccg ate tgg egg gac acg ccc gcc ccg tac ggg ccg 48 0 
Glu Ser Val Ala Pro lie Trp Arg Asp Thr Pro Ala Pro Tyr Gly Pro 
145 150 155 160 

ttc ttc gtg ctg etc gcc gcg etc gcg gtg acc etc ggc ggc ggc ctg 52 8 
Phe Phe Val Leu Leu Ala Ala Leu Ala Val Thr Leu Gly Gly Gly Leu 
165 170 175 

gtg ggc get gtc gtg gcg ttc cgc ctg etc gcg gtc gcc ggg gtg ttg 576 
Val Gly Ala Val Val Ala Phe Arg Leu Leu Ala Val Ala Gly Val Leu 
180 185 190 

ctg gcc gcc etc tgc ctg gtg ggc ctg gcc cgc gcc gcg ggc gtg ccc 624 
Leu Ala Ala Leu Cys Leu Val Gly Leu Ala Arg Ala Ala Gly Val Pro 
195 200 205 

acc cgc agg gcg gcc tgg ctg gcg ctg gcc tgc ccg ctg gtc ggg gtc 6 72 
Thr Arg Arg Ala Ala Trp Leu Ala Leu Ala Cys Pro Leu Val Gly Val 
210 215 220 



cac ctg gtg gcc ggc gcg cac aac gac gcg gtg atg etc ggc ctg ctg 
His Leu Val Ala Gly Ala His Asn Asp Ala Val Met Leu Gly Leu Leu 
225 230 235 240 



720 



DC01 348871 v 1 



119 



ctg ctg ggc ctg ctg gtg ctg gtg cgc ggg ccc ggc aag ccg aag ccg 768 
Leu Leu Gly Leu Leu Val Leu Val Arg Gly Pro Gly Lys Pro Lys Pro 
245 250 255 

ctg ttg gtg gcc ggg gcc ctg etc ggg ctg gcg gtg acg gtg aag gec 816 
Leu Leu Val Ala Gly Ala Leu Leu Gly Leu Ala Val Thr Val Lys Ala 
260 265 270 

acc gcc gtg gtg gtg ctt ccc ttc gcg gcg ctg gcc gcg gtg ctg ggc 864 
Thr Ala Val Val Val Leu Pro Phe Ala Ala Leu Ala Ala Val Leu Gly 
275 280 285 

cgc tac acc gtg egg gcg ctg ctg cgc gac gcc ggc tgg ctg gcc ggc 912 
Arg Tyr Thr Val Arg Ala Leu Leu Arg Asp Ala Gly Trp Leu Ala Gly 
290 295 300 

ggg acg etc ggc gcg gtg ggg gtc acc teg ctg ctg tec ggc etc gga 96 0 
Gly Thr Leu Gly Ala Val Gly Val Thr Ser Leu Leu Ser Gly Leu Gly 
305 310 315 320 

etc ggc tgg ata cgc ggg ctg acc cgc age ggg gac tec gag cag tgg 10 08 
Leu Gly Trp lie Arg Gly Leu Thr Arg Ser Gly Asp Ser Glu Gin Trp 
325 330 335 

acg teg ccc ccg acg gcg gtg ggc ttc gtc gtc gac tac gcg ggc gag 1056 
Thr Ser Pro Pro Thr Ala Val Gly Phe Val Val Asp Tyr Ala Gly Glu 
340 345 350 

etc gcc ggg egg gac ccg ggc gcg gtg ccg gcg acc cgc gcg gcg gcg 1104 
Leu Ala Gly Arg Asp Pro Gly Ala Val Pro Ala Thr Arg Ala Ala Ala 
355 360 365 

ctg ctg ctg etc gcc gtg etc gtg gcg gcg ctg tgg tgg egg gcc tgg 1152 
Leu Leu Leu Leu Ala Val Leu Val Ala Ala Leu Trp Trp Arg Ala Trp 
370 375 380 

teg ggg ctg cgc egg ctg aac gac gtc egg cag egg gtg gcc cgc ctg 12 0 0 
Ser Gly Leu Arg Arg Leu Asn Asp Val Arg Gin Arg Val Ala Arg Leu 
385 390 395 400 

gac gcc gcc cgc ccc egg gtg acc ctg etc ggc gcg ggg ctg gcg ctg 1248 
Asp Ala Ala Arg Pro Arg Val Thr Leu Leu Gly Ala Gly Leu Ala Leu 
405 410 415 

gcc gcc acg gtc etc etc gcc ccg gtc ttc cac ccc tgg tac gcc acc 1296 
Ala Ala Thr Val Leu Leu Ala Pro Val Phe His Pro Trp Tyr Ala Thr 
420 425 430 

tgg ccg ctg gcc ctg etc gcg gtc gcc gcg acg egg acc acc tgg ttc 1344 
Trp Pro Leu Ala Leu Leu Ala Val Ala Ala Thr Arg Thr Thr Trp Phe 
435 440 445 

gtg gcg ccc tgc gcg gcg gcg gcc ttc etc acc ctg ccc gac ggc acc 1392 
Val Ala Pro Cys Ala Ala Ala Ala Phe Leu Thr Leu Pro Asp Gly Thr 
450 455 460 

aac ctg gcc egg ttc acc aag gcc ccg ggc gcg ate gcg atg acc gcg 144 0 
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Asn Leu Ala Arg Phe Thr Lys Ala Pro Gly Ala He Ala Met Thr Ala 
465 470 475 480 



ctg gtg gcc ggg ctg gcg gtg tgg ggc ctg etc egg ctg cgc egg acc 1488 
Leu Val Ala Gly Leu Ala Val Trp Gly Leu Leu Arg Leu Arg Arg Thr 
485 490 495 

cgt gcc gcg cgc ccc ggc tga 1509 
Arg Ala Ala Arg Pro Gly * 
500 



<210> 90 
<211> 502 
<212> PRT 
<213> Bacteria 

<400> 90 

Val Thr Thr Pro Gly Ser Pro Ser Thr Ser Pro Asp Val Ser Pro Ser 

15 10 15 

Pro Asp Ala Ala Arg Leu Ala Arg Tyr Ala Gly Leu Gly Gly Ala Val 

20 25 30 

Leu Leu Ala Val Ala Gly Trp Arg Gly Gly Ala Leu Pro Ser Thr Pro 

35 40 45 

Leu Asp Val Pro Pro Gly Asp Arg Trp Leu Ser Asp Gly Gly Pro Leu 

50 55 60 

Thr Leu Gly Val Trp Leu Val Gly Thr Ala Leu Leu Val Gly Ala Trp 
65 70 75 80 

Trp Ala Leu Arg Arg Gly Ala Pro Ser Thr Arg Trp Ala Tyr Leu Thr 

85 90 95 

Ala Gly Leu Trp Ala Leu Pro Leu Leu Val Thr Pro Pro Leu Gly Ser 

100 105 HO 

Arg Asp Val Tyr Ser Tyr Ala Cys Gin Gly Trp Ala Tyr Ala His Gly 

115 120 125 

Val Asp Pro Tyr Ala Thr Gly Val Ala Glu Ala Gly Cys Pro Trp Val 

130 135 140 

Glu Ser Val Ala Pro He Trp Arg Asp Thr Pro Ala Pro Tyr Gly Pro 
145 150 155 160 

Phe Phe Val Leu Leu Ala Ala Leu Ala Val Thr Leu Gly Gly Gly Leu 

165 170 175 

Val Gly Ala Val Val Ala Phe Arg Leu Leu Ala Val Ala Gly Val Leu 

180 185 190 

Leu Ala Ala Leu Cys Leu Val Gly Leu Ala Arg Ala Ala Gly Val Pro 

195 200 205 

Thr Arg Arg Ala Ala Trp Leu Ala Leu Ala Cys Pro Leu Val Gly Val 

210 215 220 

His Leu Val Ala Gly Ala His Asn Asp Ala Val Met Leu Gly Leu Leu 
225 230 235 240 

Leu Leu Gly Leu Leu Val Leu Val Arg Gly Pro Gly Lys Pro Lys Pro 

245 250 255 

Leu Leu Val Ala Gly Ala Leu Leu Gly Leu Ala Val Thr Val Lys Ala 

260 265 270 

Thr Ala Val Val Val Leu Pro Phe Ala Ala Leu Ala Ala Val Leu Gly 

275 280 285 

Arg Tyr Thr Val Arg Ala Leu Leu Arg Asp Ala Gly Trp Leu Ala Gly 

290 295 300 

Gly Thr Leu Gly Ala Val Gly Val Thr Ser Leu Leu Ser Gly Leu Gly 
305 310 315 320 
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Leu Gly Trp lie 

Thr Ser Pro Pro 
340 

Leu Ala Gly Arg 
355 

Leu Leu Leu Leu 
370 

Ser Gly Leu Arg 
385 

Asp Ala Ala Arg 

Ala Ala Thr Val 
420 

Trp Pro Leu Ala 
435 

Val Ala Pro Cys 
450 

Asn Leu Ala Arg 
465 

Leu Val Ala Gly 

Arg Ala Ala Arg 
500 



Arg Gly Leu Thr 
325 

Thr Ala Val Gly 

Asp Pro Gly Ala 
360 

Ala Val Leu Val 
375 

Arg Leu Asn Asp 
390 

Pro Arg Val Thr 
405 

Leu Leu Ala Pro 

Leu Leu Ala Val 
440 

Ala Ala Ala Ala 
455 

Phe Thr Lys Ala 
470 

Leu Ala Val Trp 
485 

Pro Gly 



Arg Ser Gly Asp 
330 

Phe Val Val Asp 
345 

Val Pro Ala Thr 

Ala Ala Leu Trp 
380 

Val Arg Gin Arg 
395 

Leu Leu Gly Ala 
410 

Val Phe His Pro 
425 

Ala Ala Thr Arg 

Phe Leu Thr Leu 
460 

Pro Gly Ala lie 
475 

Gly Leu Leu Arg 
490 



Ser Glu Gin Trp 
335 

Tyr Ala Gly Glu 
350 

Arg Ala Ala Ala 
365 

Trp Arg Ala Trp 

Val Ala Arg Leu 
400 

Gly Leu Ala Leu 
415 

Trp Tyr Ala Thr 
430 

Thr Thr Trp Phe 
445 

Pro Asp Gly Thr 

Ala Met Thr Ala 
480 

Leu Arg Arg Thr 
495 



<210> 91 
<211> 750 
<212> DNA 
<213> Bacteria 

<220> 

<221> CDS 

<222> {!)... (750) 

<400> 91 

atg age aca gec gag gaa teg ttg ccg ggc aac gec acc acc ggc gtg 48 
Met Ser Thr Ala Glu Glu Ser Leu Pro Gly Asn Ala Thr Thr Gly Val 
15 10 15 

gtg cgc gtc ggc gac acc gtg cgc cgt ccg gtc ggc ccc tgg age gac 96 
Val Arg Val Gly Asp Thr Val Arg Arg Pro Val Gly Pro Trp Ser Asp 
20 25 30 

gtg gtg gac gec ctg ctg gaa cac ctg cac gcg gtg gga ttc gee ggt 144 
Val Val Asp Ala Leu Leu Glu His Leu His Ala Val Gly Phe Ala Gly 
35 40 45 

gee ccc egg cct ctg ggt cgc gac gcg cag ggc egg cag gtg ctg gag 192 
Ala Pro Arg Pro Leu Gly Arg Asp Ala Gin Gly Arg Gin Val Leu Glu 
50 55 60 

tac gtc cca ggc gag gtc ggc gag gcg teg ggc acg tac ccg gtg gcg 24 0 
Tyr Val Pro Gly Glu Val Gly Glu Ala Ser Gly Thr Tyr Pro Val Ala 
65 70 75 80 

gac ctg ttc gcg ate ggc egg atg ctg gee gag ctg cac gag gcg ctg 288 
Asp Leu Phe Ala lie Gly Arg Met Leu Ala Glu Leu His Glu Ala Leu 
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85 



90 



95 



gcc ggg ttc acc ccg ccg gcc ggc gcg gcg tgg cag egg etc ate ccg 3 36 
Ala Gly Phe Thr Pro Pro Ala Gly Ala Ala Trp Gin Arg Leu lie Pro 
100 105 110 

ccg gac egg gag gaa etc gtc tgc cac aac gac gtg gcc ccg tgg aac 384 
Pro Asp Arg Glu Glu Leu Val Cys His Asn Asp Val Ala Pro Trp Asn 
115 120 125 

ctg ate agg gcg gac egg ggc tgg gtg ctg ate gac tgg gac tgc gcg 43 2 
Leu lie Arg Ala Asp Arg Gly Trp Val Leu lie Asp Trp Asp Cys Ala 
130 135 140 

gcg ccg ggc tec egg etc tgg gac etc gcg tac gcc gcg cag age atg 480 
Ala Pro Gly Ser Arg Leu Trp Asp Leu Ala Tyr Ala Ala Gin Ser Met 
145 150 155 160 

gcc ggc ctg cgc ccg gac egg ccg gtg gcc gag teg gcg gcc egg ctg 528 
Ala Gly Leu Arg Pro Asp Arg Pro Val Ala Glu Ser Ala Ala Arg Leu 
165 170 175 

cgc gcc ttc gcc gac ggc tac egg ctg gac gag gcg tec cgc ccg gcc 576 
Arg Ala Phe Ala Asp Gly Tyr Arg Leu Asp Glu Ala Ser Arg Pro Ala 
180 185 190 

ctg gcc gcc atg ctg ggt cgc cgc gcc egg gcc atg tac gac ctg ttg 6 24 
Leu Ala Ala Met Leu Gly Arg Arg Ala Arg Ala Met Tyr Asp Leu Leu 
195 200 205 

cgc gag ggc gcg gaa cag egg cgc gag ccg tgg gcc egg ate tgg acc 6 72 
Arg Glu Gly Ala Glu Gin Arg Arg Glu Pro Trp Ala Arg lie Trp Thr 
210 215 220 

gag gac ggc ccg tac tgg ctg gcc acc gcc gaa cac etc gac gcc cac 72 0 
Glu Asp Gly Pro Tyr Trp Leu Ala Thr Ala Glu His Leu Asp Ala His 
225 230 235 240 

acc gag gca tgg gag ate gcc ctg cgc tga 75 0 

Thr Glu Ala Trp Glu lie Ala Leu Arg * 
245 



<210> 92 
<211> 249 
<212> PRT 
<213> Bacteria 

<400> 92 

Met Ser Thr Ala Glu Glu Ser Leu Pro Gly Asn Ala Thr Thr Gly Val 

15 10 15 

Val Arg Val Gly Asp Thr Val Arg Arg Pro Val Gly Pro Trp Ser Asp 

20 25 30 

Val Val Asp Ala Leu Leu Glu His Leu His Ala Val Gly Phe Ala Gly 

35 40 45 

Ala Pro Arg Pro Leu Gly Arg Asp Ala Gin Gly Arg Gin Val Leu Glu 

50 55 60 

Tyr Val Pro Gly Glu Val Gly Glu Ala Ser Gly Thr Tyr Pro Val Ala 
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65 70 75 80 

Asp Leu Phe Ala lie Gly Arg Met Leu Ala Glu Leu His Glu Ala Leu 

85 90 95 

Ala Gly Phe Thr Pro Pro Ala Gly Ala Ala Trp Gin Arg Leu lie Pro 

100 105 110 

Pro Asp Arg Glu Glu Leu Val Cys His Asn Asp Val Ala Pro Trp Asn 

115 120 125 

Leu lie Arg Ala Asp Arg Gly Trp Val Leu lie Asp Trp Asp Cys Ala 

130 135 140 

Ala Pro Gly Ser Arg Leu Trp Asp Leu Ala Tyr Ala Ala Gin Ser Met 
145 150 155 160 

Ala Gly Leu Arg Pro Asp Arg Pro Val Ala Glu Ser Ala Ala Arg Leu 

165 170 175 

Arg Ala Phe Ala Asp Gly Tyr Arg Leu Asp Glu Ala Ser Arg Pro Ala 

180 185 190 

Leu Ala Ala Met Leu Gly Arg Arg Ala Arg Ala Met Tyr Asp Leu Leu 

195 200 205 

Arg Glu Gly Ala Glu Gin Arg Arg Glu Pro Trp Ala Arg lie Trp Thr 

210 215 220 

Glu Asp Gly Pro Tyr Trp Leu Ala Thr Ala Glu His Leu Asp Ala His 
225 230 235 240 

Thr Glu Ala Trp Glu lie Ala Leu Arg 
245 



<210> 93 
<211> 1315 
<212> DNA 
<213> Bacteria 

<400> 93 

catcctccct cgcctaaagg cgggggagtc cgaccctcgc gggttggggt tcctggttca 6 0 
ccgcagaccg cacggaagga ggtccttcgt gtctgacgtc cgctccgcag gcgtttttcg 12 0 
tctcggccag cccggccgcg acggtgatgt tcttggctgc gttgacgtcc cggtcatgcc 180 
gggtgccgca actcggacac gtccagtggc gtgtgccgag ggagagtgtg gcgagcaggt 24 0 
gcccgcacgc cgagcaggtc ttcgacgacg ggtaccagcg gtccaccacc gcgagggtgc 3 00 
ggccgtcgcg gtgcgccttg taggtgagca gggtgcggaa ctcggcccag ccggtacgcg 360 
agatcgcttt ggccagggag tggttgcgga ccatgttcgc cacggccagg tcctccacgg 42 0 
cgatggcggc gaaccggcgc accagggcgg tggactgctg gtggaggaag tcccggcggg 48 0 
cgtcgcgcac ctgcgaatgc gctcgggcga ccattcgttt ggctttggcg cggttggcgg 540 
agcccttctg tcggcgggcc attatccgct gataccgctt gagtcggcgt tcccgccgtt 600 
ccatgtgctt cgggtggggg atgcgttcgc cggtggacag caccgcgaag tcggtcaggc 660 
cgaggtccac gcccaccgcc tcgccggtgg gttcgggtgc ggcgggtgtg tcgacgtcga 72 0 
cggcgaaggt cacgaaccag cggccgtccg ggtcacgcga caccgtcacc atcgtcggat 78 0 
ccaaccccgc cggatccacg ttcggcaacg accacacgaa ccgcagcacc ccgggtgtct 84 0 
ttcccaacga caggttcccg ctgcggaggc ggaacgccga ccgggtgtaa ctggcggact 900 
ggcggccgtg tcgggacttg tagcgcgggt accgggcccg cttggcgaag aaggcggtca 96 0 
tggcggtgtg ctggtgccgc agggtctgct gcaacggcac cgacgacacc tcacccagat 102 0 
acgccaggtc gggctgcttc ttcatctccg tcaacgcccg atcggtctcc gcgtaggagg 108 0 
tggatctccg ttcggtgtgc cagcgggcgt gacgggcggc gagcgtgcgg ttccagacga 114 0 
cacgtacaca cccgaacgtg cggttcagca ccgccgcctg ctccggggtc gggtacgccc 12 0 0 
gacacctgta cgccgtccgc acaggaccag ccctaccaga aaggacagtc gtggctgaca 1260 
acgcatccgc cgttcgtccc cgccctgaag gacgtggcat cctggcggtg atccg 1315 

<210> 94 
<211> 1263 
<212> DNA 
<213> Bacteria 
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<220> 

<221> CDS 

<222> (1) . . . (1263) 



<400> 94 

gtg eta gat atg act caa gta gac ggg teg ccc ctg cca act ctg gaa 48 
Val Leu Asp Met Thr Gin Val Asp Gly Ser Pro Leu Pro Thr Leu Glu 
15 10 15 



agg caa gtg ate acc gtg cgt gtg ctg ttc gec agt etc gga ace cat 96 
Arg Gin Val lie Thr Val Arg Val Leu Phe Ala Ser Leu Gly Thr His 
20 25 30 



ggc cac acc tac ccc ctg ctg cca ctg gec acg gee gec cgt gcg gcg 144 
Gly His Thr Tyr Pro Leu Leu Pro Leu Ala Thr Ala Ala Arg Ala Ala 
35 40 45 



ggc cac gag gtc acc ttc gec acc ggc gag ggc ttc gcg ggc acc ctg 192 
Gly His Glu Val Thr Phe Ala Thr Gly Glu Gly Phe Ala Gly Thr Leu 
50 55 60 



egg aag ctg ggc ttc gag ccg gtc gcg acc ggg atg ccg gtc ttc gac 24 0 
Arg Lys Leu Gly Phe Glu Pro Val Ala Thr Gly Met Pro Val Phe Asp 
65 70 75 80 



ggg ttc ctg gcg gcg ctg egg ate cgc ttc gac acc gac age ccc gag 28 8 
Gly Phe Leu Ala Ala Leu Arg lie Arg Phe Asp Thr Asp Ser Pro Glu 
85 90 95 



ggg ctg acc ccc gag cag etc agt gag ctg ccg cag ate gtg ttc ggg 336 
Gly Leu Thr Pro Glu Gin Leu Ser Glu Leu Pro Gin lie Val Phe Gly 
100 105 110 



egg gtc ate ccg cag cgc gtc ttc gac gag etc cag ccg gtg ate gaa 384 
Arg Val lie Pro Gin Arg Val Phe Asp Glu Leu Gin Pro Val lie Glu 
115 120 125 



egg ttg cga ccc gac etc gtg gtg cag gag ate age aac tac ggc gee 432 
Arg Leu Arg Pro Asp Leu Val Val Gin Glu lie Ser Asn Tyr Gly Ala 
130 135 140 



ggc ctg gec gee ctg aag gcg ggc ate ccg acc ate tgc cac ggg gtc 480 
Gly Leu Ala Ala Leu Lys Ala Gly lie Pro Thr lie Cys His Gly Val 
145 150 155 160 



ggc egg gac acg ccg gac gac ctg acc egg tec ate gag gag gag gtg 52 8 
Gly Arg Asp Thr Pro Asp Asp Leu Thr Arg Ser lie Glu Glu Glu Val 
165 170 175 



egg ggg ctg gec cag egg etc ggc etc gac ctg ccg ccc ggg cgc ate 576 
Arg Gly Leu Ala Gin Arg Leu Gly Leu Asp Leu Pro Pro Gly Arg lie 
180 185 190 



gac ggc ttc ggc aac ccc ttc ate gac ate ttc ccg ccg teg ctg cag 624 
Asp Gly Phe Gly Asn Pro Phe lie Asp lie Phe Pro Pro Ser Leu Gin 
195 200 205 



gag ccg gag ttc egg gee cgc ccg egg cgc cac gag ctg cgc ccg gtg 672 
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Glu Pro Glu Phe Arg Ala Arg Pro Arg Arg His Glu Leu Arg Pro Val 
210 215 220 



ccc ttc gcc gag cag ggt gac etc ccg gec tgg ctg tec teg cgc gac 720 
Pro Phe Ala Glu Gin Gly Asp Leu Pro Ala Trp Leu Ser Ser Arg Asp 
225 230 235 240 

acg gcc cgc ccg ctg gtc tac ctg acg etc ggc acg tec age ggc ggc 76 8 
Thr Ala Arg Pro Leu Val Tyr Leu Thr Leu Gly Thr Ser Ser Gly Gly 
245 250 255 

acc gtc gag gtg ctg egg gcg gcg ate gac ggg etc gcc ggc etc gac 816 
Thr Val Glu Val Leu Arg Ala Ala lie Asp Gly Leu Ala Gly Leu Asp 
260 265 270 

gcc gac gtc ctg gtc gcc age ggc ccg tec etc gac gtc age gga ctg 864 
Ala Asp Val Leu Val Ala Ser Gly Pro Ser Leu Asp Val Ser Gly Leu 
275 280 285 

ggc gag gtg ccg gca aac gta egg etc gag teg tgg gtg ccg cag gcg 912 
Gly Glu Val Pro Ala Asn Val Arg Leu Glu Ser Trp Val Pro Gin Ala 
290 295 300 

gcc ctg ctg ccc cac gtc gac ctg gtg gtg cac cac ggg ggc age ggc 96 0 
Ala Leu Leu Pro His Val Asp Leu Val Val His His Gly Gly Ser Gly 
305 310 315 320 

act acg etc ggc gca ctg ggc gcc ggc gtg ccg cag ctg tec ttc ccg 1008 
Thr Thr Leu Gly Ala Leu Gly Ala Gly Val Pro Gin Leu Ser Phe Pro 
325 330 335 

tgg gcg ggg g ac tcg ttc g cc aac gcg ca g gcg gtg gcg cag gcc ggc 1056 

Trp Ala Gly Asp Ser Phe Ala Asn Ala Gin Ala Val Ala Gin Ala Gly 
340 345 350 

gcc ggt gac cac ctg ctg ccc gac aac ate age ccc gac tcg gtg tcg 1104 
Ala Gly Asp His Leu Leu Pro Asp Asn lie Ser Pro Asp Ser Val Ser 
355 360 365 

ggc gcc gcg aag cgc ctg ttg gcc gag gag age tac egg gcc ggg gcg 1152 
Gly Ala Ala Lys Arg Leu Leu Ala Glu Glu Ser Tyr Arg Ala Gly Ala 
370 375 380 

egg gcc gtg gcg gcc gag ate gcg gcc atg ccg ggc ccc gac gag gtc 12 00 
Arg Ala Val Ala Ala Glu lie Ala Ala Met Pro Gly Pro Asp Glu Val 
385 390 395 400 

gtc cgc ctg ctg ccg ggc ttc gcc tec agg age gcg ggc tga ccg gcg 124 8 
Val Arg Leu Leu Pro Gly Phe Ala Ser Arg Ser Ala Gly * Pro Ala 
405 410 415 

eta cgt ctg ccg tag 1263 
Leu Arg Leu Pro * 



<210> 95 
<211> 419 
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<212> PRT 
<213> Bacteria 



<400> 95 
Val Leu Asp Met 
1 

Arg Gin Val lie 
20 

Gly His Thr Tyr 
35 

Gly His Glu Val 
50 

Arg Lys Leu Gly 
65 

Gly Phe Leu Ala 

Gly Leu Thr Pro 
100 

Arg Val lie Pro 
115 

Arg Leu Arg Pro 
130 

Gly Leu Ala Ala 
145 

Gly Arg Asp Thr 

Arg Gly Leu Ala 
180 

Asp Gly Phe Gly 
195 

Glu Pro Glu Phe 
210 

Pro Phe Ala Glu 
225 

Thr Ala Arg Pro 

Thr Val Glu Val 
260 

Ala Asp Val Leu 
275 

Gly Glu Val Pro 
290 

Ala Leu Leu Pro 
305 

Thr Thr Leu Gly 

Trp Ala Gly Asp 
340 

Ala Gly Asp His 
355 

Gly Ala Ala Lys 
370 

Arg Ala Val Ala 
385 

Val Arg Leu Leu 
Arg Leu Pro 



Thr Gin Val Asp 
5 

Thr Val Arg Val 

Pro Leu Leu Pro 
40 

Thr Phe Ala Thr 
55 

Phe Glu Pro Val 
70 

Ala Leu Arg lie 
85 

Glu Gin Leu Ser 

Gin Arg Val Phe 
120 

Asp Leu Val Val 
135 

Leu Lys Ala Gly 
150 

Pro Asp Asp Leu 
165 

Gin Arg Leu Gly 

Asn Pro Phe lie 
200 

Arg Ala Arg Pro 
215 

Gin Gly Asp Leu 
230 

Leu Val Tyr Leu 
245 

Leu Arg Ala Ala 

Val Ala Ser Gly 
280 

Ala Asn Val Arg 
295 

His Val Asp Leu 
310 

Ala Leu Gly Ala 
325 

Ser Phe Ala Asn 

Leu Leu Pro Asp 
360 

Arg Leu Leu Ala 
375 

Ala Glu lie Ala 
390 

Pro Gly Phe Ala 
405 



Gly Ser Pro Leu 
10 

Leu Phe Ala Ser 
25 

Leu Ala Thr Ala 

Gly Glu Gly Phe 
60 

Ala Thr Gly Met 
75 

Arg Phe Asp Thr 
90 

Glu Leu Pro Gin 
105 

Asp Glu Leu Gin 

Gin Glu lie Ser 
140 

lie Pro Thr lie 
155 

Thr Arg Ser lie 
170 

Leu Asp Leu Pro 
185 

Asp lie Phe Pro 

Arg Arg His Glu 
220 

Pro Ala Trp Leu 
235 

Thr Leu Gly Thr 
250 

lie Asp Gly Leu 
265 

Pro Ser Leu Asp 

Leu Glu Ser Trp 
300 

Val Val His His 
315 

Gly Val Pro Gin 
330 

Ala Gin Ala Val 
345 

Asn lie Ser Pro 

Glu Glu Ser Tyr 
380 

Ala Met Pro Gly 
395 

Ser Arg Ser Ala 
410 



Pro Thr Leu Glu 
15 

Leu Gly Thr His 
30 

Ala Arg Ala Ala 
45 

Ala Gly Thr Leu 

Pro Val Phe Asp 
80 

Asp Ser Pro Glu 
95 

lie Val Phe Gly 
110 

Pro Val lie Glu 
125 

Asn Tyr Gly Ala 

Cys His Gly Val 
160 

Glu Glu Glu Val 
175 

Pro Gly Arg lie 
190 

Pro Ser Leu Gin 
205 

Leu Arg Pro Val 

Ser Ser Arg Asp 
240 

Ser Ser Gly Gly 
255 

Ala Gly Leu Asp 
270 

Val Ser Gly Leu 
285 

Val Pro Gin Ala 

Gly Gly Ser Gly 
320 

Leu Ser Phe Pro 
335 

Ala Gin Ala Gly 
350 

Asp Ser Val Ser 
365 

Arg Ala Gly Ala 

Pro Asp Glu Val 
400 

Gly Pro Ala Leu 
415 
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